Hi Thierry and others,
I have been attempting a 3-way coupling for a large coastal domain, with tides, waves and sediments including bed-roughness feedback, using v7p2r0 on HPC Wales.
My initial run (12 hr duration) ends correctly and the solution files for T2D, Tom & Sis are fine. [The only caveat here is that any attempt to initialise Sisyphe with a PREVIOUS SEDIMENTOLOGICAL COMPUTATION FILE appears to be ignored.]
However, my query here concerns continuation of the solution (COMPUTATION CONTINUED, NEXT COMPUTATION) which has been unsuccessful so far. On continuation the resumption time is identified correctly (i.e. at 12hr, see output error 1 attached for a sequential continuation run). However, the run fails during its call from wac.F to ecrspe.f, and specifically, I think, at the call made from here to P_SYNCH.
The same error involving I/O to unit 17 occurs both for a sequential run (1 processor - see error 1 attached) and also for parallel continuation (e.g. with 16 processors). There are no external I/O files involved in any of T2D, Tom or Sis, though Sisyphe uses 4 Private Arrays.
If I comment out the calls to ecrspe.f, the continuation run then does make some progress:-
- For a sequential run (1 processor), three correct solution files are produced (t2d, tom & sis). But the global results file WACBRI is empty, meaning that no further continuation is possible.
- For a parallel run (16 processors), the continuation run again ends correctly. However, the results files are not moved out of the working directory and an error is generated ('Attempting to use an MPI routine before initializing MPI' - see error 2 attached). In this case, the sub-domain results are merged correctly for t2d (T2DRES) and tom (WACRES). However the gretel operation does not happen for Sisyphe (i.e. no SISRES), though the subdomain SIS solutions are available in the working directory. The global results file (WACBRI) is again empty following an error with runGRETEL. So in this parallel case there is no merged Sisyphe solution and again further continuation is not possible for tomawac.
I wonder whether anyone else has encountered this kind of issue, possibly involving the numbering of (tomawac) channel units in coupled mode? Any insights will be very much appreciated.
Kind regards, Alan