[mvapich-discuss] mvapich 1.0.1 Got error polling CQ

Dhabaleswar Panda panda at cse.ohio-state.edu
Mon Aug 3 21:19:58 EDT 2009


This error indicates that when a process is able to poll Completion Queue
(CQ) of InfiniBand ntwork, it is getting an error.

On your IB set-up, are you able to carry out IB native-level (verbs-level,
not MPI-level) tests across the nodes. Please make sure that the IB set-up
is correct. Then you can carry out MPI-level tests.

DK

On Mon, 3 Aug 2009, Jeff Haferman wrote:

>
> Hi -
> This is our first try of IB and mvapich, and yes I will install a more up-to-date version, but could someone explain the following:
>
> mpirun -np 2 -hostfile hostfile ./osu_latency
> Abort signaled by rank 0: [compute-0-0.local:0] Got error polling CQ
>
> Abort signaled by rank 1: Error polling CQ
>
> Exit code -3 signaled from compute-ib-0-0
> Killing remote processes...MPI process terminated unexpectedly
> MPI process terminated unexpectedly
> DONE
>
>
> This is mvapich 1.0.1 compiled with gnu 4.1.2 on Centos 5.2 with Linux kernel 2.6.18-92.1.26 and ofed 1.3.1
> What is "Error polling CQ"?  I done a search and read the manual but can't find anything helpful.
>
> Jeff
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>



More information about the mvapich-discuss mailing list