@riadh :
A team provided me a test case, and I have to run it on my cluster.. I really don't know much about the scientific context of this case.
I hope the mesh is ok, but I really don't know.
In the file .out, I can see :
NOMBRE D'ELEMENTS: 1418656
NOMBRE REEL DE POINTS: 738888
I looked at the discussion you pointed, and it seems that I have what is needed : version 6.1 and when I checked the variable FMT4 in partel.f, it is set to I7 as it is recommended.
thanks a lot
@jmhervouet :
I tried to do all the modifications you told me, but there is a problem when I modify the parameter :
PROCESSEURS PARALLELES = 0
the error file for PROCESSEURS PARALLELES = 0 is:
forrtl: No such file or directory
forrtl: severe (29): file not found, unit 312, file /home/charpentierj/TestTelemacHPC4T2D/tours_28dec/cas.cas26909_tmp/..\strickler.txt
Image PC Routine Line Source
out26909_hpclr.ex 00000000009232ED Unknown Unknown Unknown
out26909_hpclr.ex 0000000000921DF5 Unknown Unknown Unknown
out26909_hpclr.ex 00000000008CC7C0 Unknown Unknown Unknown
out26909_hpclr.ex 00000000008810FA Unknown Unknown Unknown
out26909_hpclr.ex 00000000008808F0 Unknown Unknown Unknown
out26909_hpclr.ex 000000000089363D Unknown Unknown Unknown
out26909_hpclr.ex 000000000042AF29 Unknown Unknown Unknown
out26909_hpclr.ex 00000000005E5DFB Unknown Unknown Unknown
out26909_hpclr.ex 0000000000442DA3 Unknown Unknown Unknown
out26909_hpclr.ex 000000000042BD81 Unknown Unknown Unknown
out26909_hpclr.ex 000000000042A01C Unknown Unknown Unknown
libc.so.6 00000036C261D994 Unknown Unknown Unknown
out26909_hpclr.ex 0000000000429F29 Unknown Unknown Unknown
Command exited with non-zero status 29
1.43user 0.42system 0:01.89elapsed 98%CPU (0avgtext+0avgdata 0maxresident)k
0inputs+0outputs (0major+332664minor)pagefaults 0swaps
## Erreur : Fin anormale : time ./out26909_hpclr.exe :7424
real 0m13.000s
user 0m2.079s
sys 0m0.632s
Then i tried with the value PROCESSEURS PARALLELES = 1
The error file is a little bit different but there is still a problem.
In addition, a file named mpirun.txt is created with 12 times the hosts name.
the error file for PROCESSEURS PARALLELES = 1 is :
forrtl: No such file or directory
forrtl: severe (29): file not found, unit 312, file /home/charpentierj/TestTelemacHPC4T2D/tours_28dec/cas.cas27118_tmp/..\strickler.txt
Image PC Routine Line Source
out27118_hpclr.ex 0000000000962FED Unknown Unknown Unknown
out27118_hpclr.ex 0000000000961AF5 Unknown Unknown Unknown
out27118_hpclr.ex 000000000090C4C0 Unknown Unknown Unknown
out27118_hpclr.ex 00000000008C0DFA Unknown Unknown Unknown
out27118_hpclr.ex 00000000008C05F0 Unknown Unknown Unknown
out27118_hpclr.ex 00000000008D333D Unknown Unknown Unknown
out27118_hpclr.ex 0000000000431029 Unknown Unknown Unknown
out27118_hpclr.ex 00000000005EBEFB Unknown Unknown Unknown
out27118_hpclr.ex 0000000000448EA3 Unknown Unknown Unknown
out27118_hpclr.ex 0000000000431E81 Unknown Unknown Unknown
out27118_hpclr.ex 000000000043011C Unknown Unknown Unknown
libc.so.6 00000036C261D994 Unknown Unknown Unknown
out27118_hpclr.ex 0000000000430029 Unknown Unknown Unknown
mpirun has exited due to process rank 0 with PID 27145 on
node node052 exiting without calling "finalize". This may
have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).
## Erreur : Fin anormale : cd /home/charpentierj/TestTelemacHPC4T2D/tours_28dec/cas.cas27118_tmp; mpirun out27118_hpclr.exe :7424
real 0m14.356s
user 0m2.334s
sys 0m0.688s
Then i tried to adapt the values in my submission script (#@TOTAL_TASKS=1) and in mpi_telemac.conf for both case (PROCESSEURS PARALLELES = 0 & PROCESSEURS PARALLELES = 1) but it doesn't change anything...
Do you have an idea of what can i do to solve this?
Best regards,
Johanne Charpentier