[mvapich-discuss] mvapich 1.0.1 Got error polling CQ

Jeff Haferman jeff at haferman.com
Mon Aug 3 22:38:16 EDT 2009


Thank you, you are correct... verbs level tests are failing.  They were
working earlier, so something broke.



Dhabaleswar Panda wrote:
> This error indicates that when a process is able to poll Completion Queue
> (CQ) of InfiniBand ntwork, it is getting an error.
> 
> On your IB set-up, are you able to carry out IB native-level (verbs-level,
> not MPI-level) tests across the nodes. Please make sure that the IB set-up
> is correct. Then you can carry out MPI-level tests.
> 
> DK
> 
> On Mon, 3 Aug 2009, Jeff Haferman wrote:
> 
>>
>> Hi -
>> This is our first try of IB and mvapich, and yes I will install a more up-to-date version, but could someone explain the following:
>>
>> mpirun -np 2 -hostfile hostfile ./osu_latency
>> Abort signaled by rank 0: [compute-0-0.local:0] Got error polling CQ
>>
>> Abort signaled by rank 1: Error polling CQ
>>
>> Exit code -3 signaled from compute-ib-0-0
>> Killing remote processes...MPI process terminated unexpectedly
>> MPI process terminated unexpectedly
>> DONE
>>
>>
>> This is mvapich 1.0.1 compiled with gnu 4.1.2 on Centos 5.2 with Linux kernel 2.6.18-92.1.26 and ofed 1.3.1
>> What is "Error polling CQ"?  I done a search and read the manual but can't find anything helpful.
>>
>> Jeff
>> _______________________________________________
>> mvapich-discuss mailing list
>> mvapich-discuss at cse.ohio-state.edu
>> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>>
> 



More information about the mvapich-discuss mailing list