[mvapich-discuss] Re: Errors running mvapich2
Abhinav Vishnu
vishnu at cse.ohio-state.edu
Sat Sep 30 13:10:15 EDT 2006
Hi Amit,
>
> Hi Abhinav,
>
> I have been trying to send emails to the mvapich2-discussion list since
> yesterday. My email bounced back. I am not sure if there is some problem
> with your mailserver or you guys decided not to entertain me anymore :-).
>
We are sorry for thr problem. As you might have noticed, there was a
problem due to the reconfiguration of our department's mail server.
>
> I am stuck with the following errors.
> ===============================================
> Hi MVAPICH2,
>
> Need help running mvapich2 programs. Running on Opteron & using GNU
> compilers.
>
> Using MVAPICH2-0-9.5
> I am trying to run osu_latency benchmark. I end up with the following
> error.
> Running it as a user. Not sure if this is some kind of permission problem ?
> I am using SilverStorm InfinIO 3000 switch and its VAPI library.
>
> #> mpiexec -machinefile machines -n 3 ./osu_latency.mvapich
> [rdma_iba_priv.c:586] error(-246): cannot query HCA
> rank 0 in job 3 compute-0-10.local_33860 caused collective abort of all
> ranks
> exit status of rank 0: killed by signal 9
>
I think it looks like a problem due to the InfiniBand drivers not being
up. There is a verbs level utility perf_main which is used for
communication below MPI layer. Are you able to communicate using
perf_main? Please let us know the outcome.
Thanks and regards,
-- Abhinav
>
> Thank you for any feedback or help,
> -Amit
> =================================================
>
>
More information about the mvapich-discuss
mailing list