[mvapich-discuss] mvapich 1.0.1 Got error polling CQ

Pavel Shamis (Pasha) pashash at gmail.com
Tue Aug 4 04:10:51 EDT 2009


Jeff,
You can use the ibdiagnet Open Fabrics tool for IB network debug:
http://linux.die.net/man/1/ibdiagnet

Pasha

Jeff Haferman wrote:
> Thank you, you are correct... verbs level tests are failing.  They were
> working earlier, so something broke.
>
>
>
> Dhabaleswar Panda wrote:
>   
>> This error indicates that when a process is able to poll Completion Queue
>> (CQ) of InfiniBand ntwork, it is getting an error.
>>
>> On your IB set-up, are you able to carry out IB native-level (verbs-level,
>> not MPI-level) tests across the nodes. Please make sure that the IB set-up
>> is correct. Then you can carry out MPI-level tests.
>>
>> DK
>>
>> On Mon, 3 Aug 2009, Jeff Haferman wrote:
>>
>>     
>>> Hi -
>>> This is our first try of IB and mvapich, and yes I will install a more up-to-date version, but could someone explain the following:
>>>
>>> mpirun -np 2 -hostfile hostfile ./osu_latency
>>> Abort signaled by rank 0: [compute-0-0.local:0] Got error polling CQ
>>>
>>> Abort signaled by rank 1: Error polling CQ
>>>
>>> Exit code -3 signaled from compute-ib-0-0
>>> Killing remote processes...MPI process terminated unexpectedly
>>> MPI process terminated unexpectedly
>>> DONE
>>>
>>>
>>> This is mvapich 1.0.1 compiled with gnu 4.1.2 on Centos 5.2 with Linux kernel 2.6.18-92.1.26 and ofed 1.3.1
>>> What is "Error polling CQ"?  I done a search and read the manual but can't find anything helpful.
>>>
>>> Jeff
>>> _______________________________________________
>>> mvapich-discuss mailing list
>>> mvapich-discuss at cse.ohio-state.edu
>>> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>>>
>>>       
>
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
>   



More information about the mvapich-discuss mailing list