[mvapich-discuss] mvapich 1.0.1 Got error polling CQ
Pavel Shamis (Pasha)
pashash at gmail.com
Tue Aug 4 04:10:51 EDT 2009
Jeff,
You can use the ibdiagnet Open Fabrics tool for IB network debug:
http://linux.die.net/man/1/ibdiagnet
Pasha
Jeff Haferman wrote:
> Thank you, you are correct... verbs level tests are failing. They were
> working earlier, so something broke.
>
>
>
> Dhabaleswar Panda wrote:
>
>> This error indicates that when a process is able to poll Completion Queue
>> (CQ) of InfiniBand ntwork, it is getting an error.
>>
>> On your IB set-up, are you able to carry out IB native-level (verbs-level,
>> not MPI-level) tests across the nodes. Please make sure that the IB set-up
>> is correct. Then you can carry out MPI-level tests.
>>
>> DK
>>
>> On Mon, 3 Aug 2009, Jeff Haferman wrote:
>>
>>
>>> Hi -
>>> This is our first try of IB and mvapich, and yes I will install a more up-to-date version, but could someone explain the following:
>>>
>>> mpirun -np 2 -hostfile hostfile ./osu_latency
>>> Abort signaled by rank 0: [compute-0-0.local:0] Got error polling CQ
>>>
>>> Abort signaled by rank 1: Error polling CQ
>>>
>>> Exit code -3 signaled from compute-ib-0-0
>>> Killing remote processes...MPI process terminated unexpectedly
>>> MPI process terminated unexpectedly
>>> DONE
>>>
>>>
>>> This is mvapich 1.0.1 compiled with gnu 4.1.2 on Centos 5.2 with Linux kernel 2.6.18-92.1.26 and ofed 1.3.1
>>> What is "Error polling CQ"? I done a search and read the manual but can't find anything helpful.
>>>
>>> Jeff
>>> _______________________________________________
>>> mvapich-discuss mailing list
>>> mvapich-discuss at cse.ohio-state.edu
>>> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>>>
>>>
>
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
>
More information about the mvapich-discuss
mailing list