[mvapich-discuss] problem with running mvapich

biswajit at crlindia.com biswajit at crlindia.com
Tue Jun 17 06:54:45 EDT 2008


When I ran a simple MPI application with  mvapich2-1.0.2, I got the 
following error messages:

 
Unknown Mellanox PCI-Express HCA best guess as Mellanox PCI-Express SDR
[3] Abort: Not enough ports are in active stateneeded active ports 1
 at line 424 in file rdma_iba_priv.c
rank 3 in job 1  n23_32790   caused collective abort of all ranks
  exit status of rank 3: return code 252

But there is a active port in each node. See the below 'ibstat' output.


CA 'mthca0'
        CA type: MT25204
        Number of ports: 1
        Firmware version: 1.1.0
        Hardware version: a0
        Node GUID: 0x0019bbfffff70cb8
        System image GUID: 0x0019bbfffff70cbb
        Port 1:
                State: Down
                Physical state: Polling
                Rate: 10
                Base lid: 0
                LMC: 0
                SM lid: 0
                Capability mask: 0x02510a68
                Port GUID: 0x0019bbfffff70cb9
CA 'mthca1'
        CA type: MT25204
        Number of ports: 1
        Firmware version: 1.1.0
        Hardware version: a0
        Node GUID: 0x0019bbfffff7fbe8
        System image GUID: 0x0019bbfffff7fbeb
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 20
                Base lid: 226
                LMC: 0
                SM lid: 117
                Capability mask: 0x02510a68
                Port GUID: 0x0019bbfffff7fbe9

 And, whenever I run same job in nodes with IB port 1 active, it works 
properly.
Is there any option in MVAPICH to select the IB port which should be used 
?

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20080617/b6b609f4/attachment-0001.html


More information about the mvapich-discuss mailing list