[mvapich-discuss] segmentation fault

Hoot Thompson hoot at ptpnow.com
Mon Jun 18 15:50:06 EDT 2012


A little background, I've been working with Mellanox to get SR-IOV 
working between two Virtual Machines (VM). As of today, I have two real 
machines each with a VM and a virtual IB connection up between them. 
Logged into one of the VMs, I can ping and run the rdma_bw and rdma_lat 
tests between the VMs just fine. Attempts to run osu_bw (compiled with 
the Intel compiler), fails with the following................


[root at penguin1-vm1 mvapich2-1.8-r5435]# mpiexec -n 2 -hosts 
10.10.10.1,10.10.10.2 /root/osu/mvapich2-1.8-r5435/osu_benchmarks/osu_bw
[penguin1-vm1:mpi_rank_0][error_sighandler] Caught error: Segmentation 
fault (signal 11)
[pengui2-vm1:mpi_rank_1][error_sighandler] Caught error: Segmentation 
fault (signal 11)

=====================================================================================
=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
=   EXIT CODE: 139
=   CLEANING UP REMAINING PROCESSES
=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
=====================================================================================
[proxy:0:1 at pengui2-vm1] HYD_pmcd_pmip_control_cmd_cb 
(./pm/pmiserv/pmip_cb.c:955): assert (!closed) failed
[proxy:0:1 at pengui2-vm1] HYDT_dmxu_poll_wait_for_event 
(./tools/demux/demux_poll.c:77): callback returned error status
[proxy:0:1 at pengui2-vm1] main (./pm/pmiserv/pmip.c:226): demux engine 
error waiting for event
APPLICATION TERMINATED WITH THE EXIT STRING: Segmentation fault (signal 11)


Any thoughts?


Hoot


More information about the mvapich-discuss mailing list