Welcome, Guest
Username: Password: Remember me

TOPIC: installation and run on cluster

installation and run on cluster 9 years 2 months ago #18296

  • c.coulet
  • c.coulet's Avatar
  • OFFLINE
  • Moderator
  • Posts: 3722
  • Thank you received: 1031
You have to add the option --hpc!

regards
Christophe
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18297

  • Gaeta
  • Gaeta's Avatar
where?
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18298

  • c.coulet
  • c.coulet's Avatar
  • OFFLINE
  • Moderator
  • Posts: 3722
  • Thank you received: 1031
in the command line
Christophe
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18299

  • Gaeta
  • Gaeta's Avatar
Maybe you refer to --mpi option (--hpc doesn't exist).
BTW, also with this option (--mpi) I got the error

|runCode: Fail to run
|mpiexec -machinefile MPI_HOSTFILE -n 1 out_WaveWind_VarS-T_G3
|~~~~~~~~~~~~~~~~~~
|**********************************************************
|
|Open MPI does not support recursive calls of mpiexec
|
|**********************************************************
|~~~~~~~~~~~~~~~~~~
Primary job terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.

mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:

Process name: [[5273,1],0]
Exit code: 1

and in the error file
Driving: /cineca/prod/compilers/gnu/4.9.2/none/bin/gfortran -fopenmp -fconvert=big-endian -frecord-marker=4 -lpthread -v -l gfortran -lm -o WaveWind_VarS-T_G3 wacfort.o /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/tomawac/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/bief/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/damocles/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/parallel/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/special/homere_tomawac.a /galileo/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib/libmpi.a -I/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/include -pthread -I/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib -L/cineca/sysprod/pbs/12.3.0.143517/lib -Wl,-rpath -Wl,/cineca/sysprod/pbs/12.3.0.143517/lib -Wl,-rpath -Wl,/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib -Wl,--enable-new-dtags -L/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib -lmpi_usempif08 -lmpi_usempi_ignore_tkr -lmpi_mpifh -lmpi -l gfortran -l m -shared-libgcc
Using built-in specs.
COLLECT_GCC=/cineca/prod/compilers/gnu/4.9.2/none/bin/gfortran
COLLECT_LTO_WRAPPER=/galileo/prod/compilers/gnu/4.9.2/none/bin/../libexec/gcc/x86_64-unknown-linux-gnu/4.9.2/lto-wrapper
Target: x86_64-unknown-linux-gnu
Configured with: ./configure --prefix=/cineca/prod/compilers/gnu/4.9.2/none --enable-languages=c,c++,fortran --disable-multilib
Thread model: posix
gcc version 4.9.2 (GCC)
Reading specs from /galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../lib64/libgfortran.spec
rename spec lib to liborig
COLLECT_GCC_OPTIONS='-fopenmp' '-fconvert=big-endian' '-frecord-marker=4' '-v' '-o' 'WaveWind_VarS-T_G3' '-I' '/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/include' '-pthread' '-I' '/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib' '-L/cineca/sysprod/pbs/12.3.0.143517/lib' '-L/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib' '-shared-libgcc' '-mtune=generic' '-march=x86-64' '-pthread'
COMPILER_PATH=/galileo/prod/compilers/gnu/4.9.2/none/bin/../libexec/gcc/x86_64-unknown-linux-gnu/4.9.2/:/galileo/prod/compilers/gnu/4.9.2/none/bin/../libexec/gcc/
LIBRARY_PATH=/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/:/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/:/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../lib64/:/lib/../lib64/:/usr/lib/../lib64/:/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../:/lib/:/usr/lib/
Reading specs from /galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../lib64/libgomp.spec
COLLECT_GCC_OPTIONS='-fopenmp' '-fconvert=big-endian' '-frecord-marker=4' '-v' '-o' 'WaveWind_VarS-T_G3' '-I' '/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/include' '-pthread' '-I' '/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib' '-L/cineca/sysprod/pbs/12.3.0.143517/lib' '-L/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib' '-shared-libgcc' '-mtune=generic' '-march=x86-64' '-pthread'
/galileo/prod/compilers/gnu/4.9.2/none/bin/../libexec/gcc/x86_64-unknown-linux-gnu/4.9.2/collect2 --eh-frame-hdr -m elf_x86_64 -dynamic-linker /lib64/ld-linux-x86-64.so.2 -o WaveWind_VarS-T_G3 /lib/../lib64/crt1.o /lib/../lib64/crti.o /galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/crtbegin.o -L/cineca/sysprod/pbs/12.3.0.143517/lib -L/cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib -L/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2 -L/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc -L/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../lib64 -L/lib/../lib64 -L/usr/lib/../lib64 -L/galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../.. -lpthread -lgfortran -lm wacfort.o /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/tomawac/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/bief/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/damocles/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/parallel/homere_tomawac.a /galileo/home/userexternal/mgaeta00/Telemac/svn.opentelemac.org/svn/opentelemac/tags/v6p3r2/builds/cinecagalileoopenmpi_hpc/lib/utils/special/homere_tomawac.a /galileo/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib/libmpi.a -rpath /cineca/sysprod/pbs/12.3.0.143517/lib -rpath /cineca/prod/compilers/openmpi/1.8.4/gnu--4.9.2/lib --enable-new-dtags -lmpi_usempif08 -lmpi_usempi_ignore_tkr -lmpi_mpifh -lmpi -lgfortran -lm -lgomp -lgcc_s -lgcc -lquadmath -lm -lgcc_s -lgcc -lpthread -lc -lgcc_s -lgcc /galileo/prod/compilers/gnu/4.9.2/none/bin/../lib/gcc/x86_64-unknown-linux-gnu/4.9.2/crtend.o /lib/../lib64/crtn.o

Helps?
Attachments:
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18300

  • c.coulet
  • c.coulet's Avatar
  • OFFLINE
  • Moderator
  • Posts: 3722
  • Thank you received: 1031
I wasn't enough clear in my explanation
The command line in the configuration file is ok, don't change anything

When you run a computation add --hpc in the line
something like:
runcode.py tomawac -s --hpc cas_tom --walltime=2:00:00 --queue=parallel --ncsize=2 --ncnode=1 --jobname=job_T

regards
Christophe
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18301

  • Gaeta
  • Gaeta's Avatar
As I said, I got the following:

use -h for more help.

runcode.py: error: no such option: --hpc

And should I omit the command "mpirun" before runcode.py etc....?
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18302

  • yugi
  • yugi's Avatar
  • OFFLINE
  • openTELEMAC Guru
  • Posts: 851
  • Thank you received: 244
You need to change in your systel.cfg the commande "mpiexec" by "mpirun".
That should solve the problem.

Hope it helps.
There are 10 types of people in the world: those who understand binary, and those who don't.
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18304

  • c.coulet
  • c.coulet's Avatar
  • OFFLINE
  • Moderator
  • Posts: 3722
  • Thank you received: 1031
Sorry for my wrong advice!
The --hpc option was available in the past. I used it last year to run parallel computation on our cluster.
I didn't noticed it disappear!

Regards
Christophe
The administrator has disabled public write access.
The following user(s) said Thank You: Gaeta

installation and run on cluster 9 years 2 months ago #18303

  • Gaeta
  • Gaeta's Avatar
not yet solved, also with mpirun in the conf file.
I attached again the systel.cfg and the batch script.
Sorry, this problem is bigger than my small IT knowledge..
Thanks.
If you have the availability to run my case, I can also send you all the steering files.

G
Attachments:
The administrator has disabled public write access.

installation and run on cluster 9 years 2 months ago #18305

  • yugi
  • yugi's Avatar
  • OFFLINE
  • openTELEMAC Guru
  • Posts: 851
  • Thank you received: 244
Hi,

Could you try to copy to two file attached to this post into the configs/ folder (You will need to rename the pysource by replacing '_sh.txt' by '.sh' i had to do that because the extension was not accepted by the forum).

Then run the command
source pysource.cinecagalileompi_hpc.sh.

Then try to rerun your test.

And if it still is'nt working can you give me the output and the command you used to run the joab as weel as the HPC_SDTIN (i.e the batch script) that will be generated.

Hope it helps.
Attachments:
There are 10 types of people in the world: those who understand binary, and those who don't.
The administrator has disabled public write access.
Moderators: borisb

The open TELEMAC-MASCARET template for Joomla!2.5, the HTML 4 version.