Hello,
I'm currently trying to use openTelemac with Intel compiler.
Compilation finish without errors but when I run an example in parallel, I have this error :
Abort(604611844) on node 0 (rank 0 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7f006aea4080, count=1000, MPI_BYTE, src=1, tag=524288, MPI_COMM_WORLD, request=0x7f01468608e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(269067524) on node 1 (rank 1 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7f7ecbf8d0c0, count=1000, MPI_BYTE, src=0, tag=524288, MPI_COMM_WORLD, request=0x7f7fa79378e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(403285252) on node 2 (rank 2 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7f9b1b6f5600, count=336, MPI_BYTE, src=3, tag=524288, MPI_COMM_WORLD, request=0x7f9bf70a48e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(632068) on node 3 (rank 3 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7f59ffb97040, count=592, MPI_BYTE, src=4, tag=524288, MPI_COMM_WORLD, request=0x7f5adb54b8e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(134849796) on node 4 (rank 4 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7fd01d4b2080, count=592, MPI_BYTE, src=3, tag=524288, MPI_COMM_WORLD, request=0x7fd0f8e648e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(671720708) on node 5 (rank 5 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7fe712c57a00, count=256, MPI_BYTE, src=6, tag=524288, MPI_COMM_WORLD, request=0x7fe7ee62e8e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(873047300) on node 6 (rank 6 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7f9a7a257040, count=464, MPI_BYTE, src=8, tag=524288, MPI_COMM_WORLD, request=0x7f9b55c248e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(873047300) on node 7 (rank 7 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7ff29902b040, count=504, MPI_BYTE, src=9, tag=524288, MPI_COMM_WORLD, request=0x7ff3749eb8e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(201958660) on node 8 (rank 8 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7f39be3feb00, count=752, MPI_BYTE, src=9, tag=524288, MPI_COMM_WORLD, request=0x7f3a99db88e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Abort(269067524) on node 9 (rank 9 in comm 0): Fatal error in PMPI_Irecv: Invalid tag, error stack:
PMPI_Irecv(162): MPI_Irecv(buf=0x7fa2c96e1600, count=752, MPI_BYTE, src=8, tag=524288, MPI_COMM_WORLD, request=0x7fa3a50958e0) failed
PMPI_Irecv(96).: Invalid tag, value is 524288
Traceback (most recent call last):
File "/home/telemac/telemac-mascaret/v8p2r1/scripts/python3/telemac2d.py", line 7, in <module>
main('telemac2d')
File "/home/telemac/telemac-mascaret/v8p2r1/scripts/python3/runcode.py", line 271, in main
run_study(cas_file, code_name, options)
File "/home/telemac/telemac-mascaret/v8p2r1/scripts/python3/execution/run_cas.py", line 157, in run_study
run_local_cas(my_study, options)
File "/home/telemac/telemac-mascaret/v8p2r1/scripts/python3/execution/run_cas.py", line 65, in run_local_cas
my_study.run(options)
File "/home/telemac/telemac-mascaret/v8p2r1/scripts/python3/execution/study.py", line 612, in run
self.run_local()
File "/home/telemac/telemac-mascaret/v8p2r1/scripts/python3/execution/study.py", line 440, in run_local
run_code(self.run_cmd, self.sortie_file)
File "/home/telemac/telemac-mascaret/v8p2r1/scripts/python3/execution/run.py", line 182, in run_code
raise TelemacException('Fail to run\n'+exe)
utils.exceptions.TelemacException: Fail to run
mpirun -np 10 /home/telemac/telemac-mascaret-benchmark/modeles/malpasset/Q100.cas_2021-08-12-14h25min06s/out_user_fortran
It happens always at the same time regardless of ncsize value.
System info :
- CPU : Dual Intel Xeon Gold 6230R
- OS : Ubuntu 20.04
- opentelemac version : v8p2r1
- Systel and pysource : gitlab.nicodet.fr/-/snippets/7 or see attachment
openmpi config run just fine.