[mvapich-discuss] Re: Errors running mvapich2

Abhinav Vishnu vishnu at cse.ohio-state.edu
Sat Sep 30 13:10:15 EDT 2006


Hi Amit,

>
> Hi Abhinav,
>
> I have been trying to send emails to the mvapich2-discussion list since
> yesterday. My email bounced back. I am not sure if there is some problem
> with your mailserver or you guys decided not to entertain me anymore :-).
>

We are sorry for thr problem. As you might have noticed, there was a
problem due to the reconfiguration of our department's mail server.

>
> I am stuck with the following errors.
> ===============================================
> Hi MVAPICH2,
>
> Need help running mvapich2 programs. Running on Opteron & using GNU
> compilers.
>
> Using MVAPICH2-0-9.5
> I am trying to run osu_latency benchmark. I end up with the following
> error.
> Running it as a user. Not sure if this is some kind of permission problem ?
> I am using SilverStorm InfinIO 3000 switch and its VAPI library.
>
> #> mpiexec -machinefile machines -n 3 ./osu_latency.mvapich
> [rdma_iba_priv.c:586] error(-246): cannot query HCA
> rank 0 in job 3  compute-0-10.local_33860   caused collective abort of all
> ranks
>   exit status of rank 0: killed by signal 9
>

I think it looks like a problem due to the InfiniBand drivers not being
up. There is a verbs level utility perf_main which is used for
communication below MPI layer. Are you able to communicate using
perf_main? Please let us know the outcome.

Thanks and regards,

-- Abhinav

 >
> Thank you for any feedback or help,
> -Amit
> =================================================
>
>



More information about the mvapich-discuss mailing list