[Mvapich-discuss] mvapich2-3.0a and slingshot 11 - performance questions
Ben Kirk
benkirk at ucar.edu
Wed May 3 15:30:00 EDT 2023
Hi, I've recently installed mvapich2-3.0a with the hopes of testing
slingshot 11 support.
When ./configuring with the recommended
--with-device=ch4:ofi --with-libfabric=/opt/cray/libfabric/1.15.2.0
I am able to build the library, but find even simple pt2pt performance
exceptionally slow. Comparing intra (inside 1 node) to inter (across 2
nodes):
#********* Intra-Node-CPU (Bare Metal) *****************
#/glade/work/benkirk/codes/dev_stack/install/osu-micro-benchmarks-6.2-mvapich2-3.0a-cray_libfabric-derecho-gcc-12.2.0/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_bw
# OSU MPI Bandwidth Test v6.2
# Size Bandwidth (MB/s)
1 2.90
2 6.10
4 11.44
8 24.28
16 46.73
32 93.20
64 194.77
128 346.13
256 759.24
512 1389.39
1024 2655.62
2048 5258.23
4096 9112.13
8192 12012.65
16384 11235.18
32768 10769.48
65536 19743.95
131072 29157.27
262144 27715.51
524288 27497.12
1048576 13662.50
2097152 10807.83
4194304 10830.14
#********* Inter-Node-CPU (Bare Metal) *****************
#/glade/work/benkirk/codes/dev_stack/install/osu-micro-benchmarks-6.2-mvapich2-3.0a-cray_libfabric-derecho-gcc-12.2.0/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_bw
# OSU MPI Bandwidth Test v6.2
# Size Bandwidth (MB/s)
1 0.30
2 0.60
4 1.20
8 2.40
16 4.78
32 9.52
64 17.50
128 34.09
256 54.02
512 101.24
1024 196.80
2048 195.56
4096 262.10
8192 316.01
16384 291.34
32768 289.64
65536 290.30
131072 260.94
262144 298.70
524288 342.67
1048576 295.31
2097152 351.07
4194304 334.36
Are there any debugging variables or other tricks I should be aware of?
Thanks!!
--
Ben Kirk
NCAR Computational & Information Systems Laboratory
Consulting Services Group Head
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osu.edu/pipermail/mvapich-discuss/attachments/20230503/d2555490/attachment-0006.html>
More information about the Mvapich-discuss
mailing list