[mvapich-discuss] Announcing the release of MVAPICH2 0.9.5 with
SRQ, intergrated multi-rail and TotalView support
Pavel Shamis (Pasha)
pasha at mellanox.co.il
Wed Sep 6 11:10:06 EDT 2006
You measurements are absolutely correct the difference in IB send/recv
latency is much bigger. But in most cases in mvapich small messages will
be send via fast_path that doesn't effected by SRQ performance, is not it ?
Thanks,
Pasha
Sayantan Sur wrote:
> I used the standard `perftest' distributed by OpenIB. In particular, the
> tests send_lat.c and send_bw.c seem to measure the latency and bandwidth
> of send/recv operations. I trivially modified them to use SRQ instead of
> posting receive to the Queue pair receive queue. The performance numbers
> are noted below. Based on these numbers, it seems it is higher than the
> 2-3% threshold. May I request you post your Gen2 level comparison
> numbers? If there is something simple I'm missing, I'd like to correct it.
>
> Thanks,
> Sayantan.
>
> ===========
> Gen2 Latency (us)
> #Size SRQ Send/Recv
> 2 10.66 6.06
> 4 10.65 6.07
> 8 10.65 6.12
> 16 10.71 6.12
> 32 10.76 6.31
> 64 10.89 6.51
> 128 11.04 6.51
> 256 11.58 7.00
> 512 12.56 7.99
> 1024 13.87 9.40
>
> Gen2 Bandwidth (MB/s)
> #Size SRQ Send/Recv
> 2 0.14 0.81
> 4 0.29 1.68
> 8 0.57 3.26
> 16 1.14 6.68
> 32 2.34 13.46
> 64 4.55 26.08
> 128 9.06 51.99
> 256 18.22 105.34
> 512 36.44 214.36
> 1024 75.05 423.72
>
>
>
--
Pavel Shamis (Pasha)
Software Engineer
Mellanox Technologies LTD.
pasha at mellanox.co.il
More information about the mvapich-discuss
mailing list