Hello
Telemac3D v8p2r0 compiled without errors using gfortran and METIS (compilation config files are attached). I have python 3.8.5 installed.
Telemac3D simulations run flawlessly using one core .
When I try to run in parallel using METIS I get this error:
====================================================================
... partitioning base files (geo, conlim, sections, zones and weirs)
+> /home/luke/telemac_v8p2r0/v8p2r0/builds/ubugfopenmpi/bin/partel < PARTEL.PAR >> partel_T3DGEO.log
/bin/sh: 1: cannot open PARTEL.PAR: No such file
Traceback (most recent call last):
File "/home/luke/telemac_v8p2r0/v8p2r0/scripts/python3/telemac3d.py", line 7, in <module>
main('telemac3d')
File "/home/luke/telemac_v8p2r0/v8p2r0/scripts/python3/runcode.py", line 271, in main
run_study(cas_file, code_name, options)
File "/home/luke/telemac_v8p2r0/v8p2r0/scripts/python3/execution/run_cas.py", line 157, in run_study
run_local_cas(my_study, options)
File "/home/luke/telemac_v8p2r0/v8p2r0/scripts/python3/execution/run_cas.py", line 31, in run_local_cas
my_study.partionning(options.use_link)
File "/home/luke/telemac_v8p2r0/v8p2r0/scripts/python3/execution/study.py", line 404, in partionning
run_partition(parcmd, self.cas, g_geo, g_fmt_geo, g_conlim,
File "/home/luke/telemac_v8p2r0/v8p2r0/scripts/python3/execution/run.py", line 51, in run_partition
run_partel(partel, geom, fmtgeom, conlim, ncsize, False,
File "/home/luke/telemac_v8p2r0/v8p2r0/scripts/python3/execution/run.py", line 132, in run_partel
log = "No log available check command:\n"+par_cmd
TypeError: can only concatenate str (not "list") to str
=====================================================================
I tried a few possible solutions suggested in some forum threads such as:
> sudo apt install python-is-python3 python3-dev python3-pip libopenmpi-dev
> /usr/bin/env python (make sure it points to python3)
> Including this line in my compilation .cfg file:
incs_all: I /usr/lib/x86_64-linux-gnu/openmpi/include
> deleted the following line friom the .cfg file:
par_cmdexec: <config>/partel < PARTEL.PAR >> <partel.log>
I found a suggestion* from Yugi that changed the error to:
[C3PO:174614] *** An error occurred in MPI_Waitall
[C3PO:174614] *** reported by process [2673737729,1]
[C3PO:174614] *** on communicator MPI_COMM_WORLD
[C3PO:174614] *** MPI_ERR_TRUNCATE: message truncated
[C3PO:174614] *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
[C3PO:174614] *** and potentially your MPI job)
*Suggestion was:
"in par_cmd replace PARTEL.PAR by <partel.par>
This is somthing that changed in python3."
I also made a few other attempts with different configs in the last 7 days without success.
Any insight is welcome.
Thank you.
Márcio