[mvapich-discuss] mvapich2-gdr
Thomas Geenen
thomas.geenen at surfsara.nl
Wed Aug 27 13:59:10 EDT 2014
hi DK,
yes thats much better.
seems that i will have to do some tweaking to get similar results compared to openmpi for medium sized messages.
that will be something for tomorrow
cheers
Thomas Geenen
| Senior adviseur Supercomputing | SURFsara | Science Park 140 | 1098 XG Amsterdam | T 06 16923022 | thomas.geenen at surfsara.nl | www.surfsara.nl |
Aanwezig op di | woe | do | vr |
[SURF_SARA_fc]
________________________________
From: Panda, Dhabaleswar [panda at cse.ohio-state.edu]
Sent: Wednesday, August 27, 2014 5:52 PM
To: Thomas Geenen; mvapich-discuss at cse.ohio-state.edu
Cc: Panda, Dhabaleswar
Subject: RE: mvapich2-gdr
Hi Thomas,
MVAPICH2-GDR (with GPU Direct support) is a different package than the standard MVAPICH2 release.
The standard MVAPICH2 release does not have GPU Direct RDMA support.
Please download the latest MVAPICH2-GDR 2.0 package from MVAPICH->Download
page and use it according to the README file available inside the package (or from the
following URL) to use the GPU Direct RDMA functionality:
http://mvapich.cse.ohio-state.edu/static/media/mvapich/MV2-GDR-README.txt
Hope this helps.
Thanks,
DK
________________________________
From: mvapich-discuss-bounces at cse.ohio-state.edu on behalf of Thomas Geenen [thomas.geenen at surfsara.nl]
Sent: Wednesday, August 27, 2014 11:39 AM
To: mvapich-discuss at cse.ohio-state.edu
Subject: [mvapich-discuss] mvapich2-gdr
i try to use mvapich2-gdr on our cluster.
we have it setup with
ofed 2.1
cuda 6.5
nvidia driver 340.29
nv_peer_mem loaded
with openmpi 1.8.1 we get the expected bandwidth an latency values runing the osu_bw and osu_latency with -d cuda D D
with mvapich we get the values we get when running openmpi without gpudirect
i get the sources from
https://scm.mvapich.cse.ohio-state.edu/svn/mpi/mvapich2
i configure with
./configure --prefix=/home/thomasge/build/mvapich-tag-2.0 --enable-cuda --with-cuda-inclu
de=/hpc/sw/cuda/6.5.14/include --with-cuda-libpath=/hpc/sw/cuda/6.5.14/lib64
then i run as follows
export CUDA_VISIBLE_DEVICES=0
export MV2_USE_CUDA=1
export MV2_USE_GPUDIRECT=1
export MV2_RAIL_SHARING_POLICY=FIXED_MAPPING
export MV2_PROCESS_TO_RAIL_MAPPING=mlx4_0
export MV2_CPU_BINDING_LEVEL=SOCKET
export MV2_CPU_BINDING_POLICY=SCATTER
ulimit -l unlimited
mpirun -np 2 ./osu_latency -d cuda D D > osu_latency.out 2>&1
obviously i am overlooking something
best
Thomas Geenen
| Senior adviseur Supercomputing | SURFsara | Science Park 140 | 1098 XG Amsterdam | T 06 16923022 | thomas.geenen at surfsara.nl | www.surfsara.nl |
Aanwezig op di | woe | do | vr |
SURFsara heeft een nieuw algemeen telefoonnummer: 020 800 1300
[SURF_SARA_fc]
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20140827/1aa245ef/attachment.html>
More information about the mvapich-discuss
mailing list