[mvapich-discuss] core dump on mpi_init with ofed 2

David Minor david-m at orbotech.com
Wed Aug 1 02:24:25 EDT 2007


Hello Wei,
ib_rdma_lat and ib_rdma_bw work. Paths to ofed 1.2 are correct. Remember, I have the same problem with the mvapich that comes with ofed and the one I compiled from the 1.0 beta. What I haven't tried is compiling with udapl support. I'm using udapl successfully with the Intel MPI.
Regards,
David

-----Original Message-----
From: Abhinav Vishnu [mailto:vishnu at cse.ohio-state.edu] 
Sent: Tuesday, July 31, 2007 5:48 PM
To: David Minor
Cc: wei huang; mvapich-discuss at cse.ohio-state.edu
Subject: Re: [mvapich-discuss] core dump on mpi_init with ofed 2

Hi David,

Thanks for this information. With this information, i am speculating 
that it could be
a problem with the setup. I think following these steps may help us 
narrow down the
problem:

1. Are you able to run the Verbs level tests (ib_rdma_lat, ib_rdma_bw, 
etc) between the
two nodes?

2. Please check the path of the OFED libraries which you are linking to 
the compilation
script. I hope that you are recompiling your programs with your OFED 1.2 
MPI installation.

Please let us know the outcome of your experimentation.

Thanks,

:- Abhinav


> Hi Wei,
> I'm using 1.0 beta, but the same problem is with 0.9.8, both p3 and the version that comes with ofed 1.2 release. I compiled using the make.mvapich2.ofa option. I haven't specified any environment variables. I didn't change any of the scripts, except to set PREFIX before compiling. I reproduced the problem with a trivial program running on 2 nodes. I didn't see the problem running on ethernet on 0.9.8. 
> Thanks,
> David
>
> -----Original Message-----
> From: wei huang [mailto:huanwei at cse.ohio-state.edu] 
> Sent: Monday, July 30, 2007 4:36 PM
> To: David Minor
> Cc: mvapich-discuss at cse.ohio-state.edu
> Subject: Re: [mvapich-discuss] core dump on mpi_init with ofed 2
>
> Hi David,
>
> Thanks for letting us know the problem. Could you please tell us more
> information so we can look into this problem?
>
> 1) Which version of mvapich2 you are using? Is it 0.9.8? The latest
> version for 0.9.8 is mvapich2-0.9.8p3. Also, we have just released
> mvapich2-1.0-beta. You are welcomed to try these two version and let us
> know if your problem is reproducible there.
>
> 2) Are you using native ib verbs or udapl?
>
> 3) Have you specify any environmental variables?
>
> 4) Did you use our default compiling scripts? Or did you make any changes
> to the scripts?
>
> 5) On how many processes do you see the problem? How many processes per
> physical node?
>
> Thanks.
>
> Regards,
> Wei Huang
>
>   



More information about the mvapich-discuss mailing list