[mvapich-discuss] problem with running mvapich
biswajit at crlindia.com
biswajit at crlindia.com
Tue Jun 17 06:54:45 EDT 2008
When I ran a simple MPI application with mvapich2-1.0.2, I got the
following error messages:
Unknown Mellanox PCI-Express HCA best guess as Mellanox PCI-Express SDR
[3] Abort: Not enough ports are in active stateneeded active ports 1
at line 424 in file rdma_iba_priv.c
rank 3 in job 1 n23_32790 caused collective abort of all ranks
exit status of rank 3: return code 252
But there is a active port in each node. See the below 'ibstat' output.
CA 'mthca0'
CA type: MT25204
Number of ports: 1
Firmware version: 1.1.0
Hardware version: a0
Node GUID: 0x0019bbfffff70cb8
System image GUID: 0x0019bbfffff70cbb
Port 1:
State: Down
Physical state: Polling
Rate: 10
Base lid: 0
LMC: 0
SM lid: 0
Capability mask: 0x02510a68
Port GUID: 0x0019bbfffff70cb9
CA 'mthca1'
CA type: MT25204
Number of ports: 1
Firmware version: 1.1.0
Hardware version: a0
Node GUID: 0x0019bbfffff7fbe8
System image GUID: 0x0019bbfffff7fbeb
Port 1:
State: Active
Physical state: LinkUp
Rate: 20
Base lid: 226
LMC: 0
SM lid: 117
Capability mask: 0x02510a68
Port GUID: 0x0019bbfffff7fbe9
And, whenever I run same job in nodes with IB port 1 active, it works
properly.
Is there any option in MVAPICH to select the IB port which should be used
?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20080617/b6b609f4/attachment-0001.html
More information about the mvapich-discuss
mailing list