Hi all,
We currently use Telemac2D v5p9 at Hydro-Quebec on a 120-cpu cluster. I thought I would share with you some numbers on the parallel performance of T2D.
The benchmark was performed on Xeon 5570 2.93 machines (8 procs per node) connected through an infiniband switch. The sample case is a 264000-element mesh (133000 nodes). Simulation time is 1800 s with a 1 s time step.
Calculation times are as follows:
nProcs tcalc (s)
2 1883
4 758
8 515
16 272
24 156
32 107
40 86
48 68
56 59
64 44
72 39
80 57
88 51
The attached figure shows relative calculation time as a function of nprocs. The dashed line is the expected theoretical relative time, our reference time being set to 1.0 for our first test on 2 procs. We expect calc time to decrease by half from 2 procs to 4 procs, by 4 from 2 to 8, and so on...
The figure shows that T2D's performance is very close to the relative times we were expecting. On our current setup, there is no performace gain over 72 procs. This is probably due to the increased communication overhead.
We would be very interested to see what other are getting. For the sake of comparison, I will soon post some numbers for the Malpasset case.