[mvapich-discuss] MVAPICH2-2.3b hwloc topology issue

Doug Johnson djohnson at osc.edu
Mon Nov 27 14:26:36 EST 2017


Hi,

I'm testing 2.3b and have encountered a problem with the new hwloc
topology routines in src/mpid/ch3/channels/common/src/affinity/hwloc_bind.c.
Simple tests fail to start, hanging due to missing files under /tmp.
The strace command shows the following.

access("/tmp/mv2-hwloc-kvs_28818_0-o0433.ten.osc.edu-6758.xml", F_OK) = -1 ENOENT (No such file or directory)

The xml file is missing on all the nodes where I'm attempting to launch
the job.  Are there possibly dependencies that I'm missing for this file
to be created successfully? (don't have time to step through the code.)

I was able to continue my tests successfully by setting
MV2_BCAST_HWLOC_TOPOLOGY=0.

Output from mpichversion looks like the following.

MVAPICH2 Version:       2.3b
MVAPICH2 Release date:  Thu Aug 10 22:00:00 EST 2017
MVAPICH2 Device:        ch3:mrail
MVAPICH2 configure:     --prefix=/opt/mvapich2/intel/17.0/2.3b --enable-shared --with-mpe --enable-romio --enable-mpit-pvars=mv2 --disable-option-checking --with-file-system=ufs+nfs+gpfs --with-pbs=/opt/torque --with-pbs-lib=/opt/torque/lib64 --with-pbs-include=/opt/torque/include
MVAPICH2 CC:    icc    -DNDEBUG -DNVALGRIND -O2
MVAPICH2 CXX:   icpc   -DNDEBUG -DNVALGRIND -O2
MVAPICH2 F77:   ifort -L/lib -L/lib   -O2
MVAPICH2 FC:    ifort   -O2


Best,
Doug


More information about the mvapich-discuss mailing list