[mvapich-discuss] problem w/MVAPICH in the frames of Gen1

Mikhail Kuzminsky kus at free.net
Fri Aug 4 12:05:58 EDT 2006


In message from Sayantan Sur <surs at cse.ohio-state.edu> (Fri, 04 Aug 
2006 09:20:56 -0500):
>Mikail,
<skipped>

>Thanks for trying out our latest version on your cluster. Clearly, 
>the problem in both versions stems from the inability of the VAPI 
>(underlying IB layer) to create a protection domain. The error 
>indicates that when MVAPICH calls the function VAPI_alloc_pd(), the 
>function doesn't return success.

:-(

>
>My hunch is that your IB installation is not proper. In particular, 
>the kernel modules which support IB, might not be working well with 
>your kernel. Could you please verify from Mellanox that the kernel 
>you are using is infact supported by IBGD?

IBGD-1.8.0 (I'm using) has 2.4.21 support (formally for Rocks 
distribution, but I use SuSE). IBGD-1.8.2 works officially only w/2.6. 
 

Some time ago I upgraded from 1.6.1 (w/working MPI) to 1.8.0 
just by Mellanox staff reccomendation, and TCP/IP stack w/SDP works in 
1.8.0 OK. 

> Pasha, any thoughts?
>
>You may also try to run some benchmarks which use VAPI only, like 
>`perf_main' to check if they have the same error too.

No, perf_main for bandwith (-trc) gives normal 791.4 Mbytes/s.

Yours
Mikhail




More information about the mvapich-discuss mailing list