[mvapich-discuss] error in ibv_channel_manager.c
Ben
Benjamin.M.Auer at nasa.gov
Fri Aug 9 13:18:33 EDT 2013
I'm getting this error message from mvapich while running a program at
large core count. It appears to be during a read being performed by the
root process. Does anyone have any ideas on what could be causing it?
Got FATAL event 0
at line 990 in file ibv_channel_manager.c
my run command is:
mpirun_rsh -hostfile $PBS_NODEIFLE -np 7200 MV2_USE_UD_HYBRID=0 executable
and a mpiname -a shows:
MVAPICH2 1.8.1 Thu Sep 27 18:55:23 EDT 2012 ch3:mrail
Compilation
CC: icc -fpic -m64 -DNDEBUG -DNVALGRIND -O2
CXX: icpc -fpic -m64 -DNDEBUG -DNVALGRIND -O2
F77: ifort -fpic -O2
FC: ifort -fpic -O2
Configuration
CC=icc CXX=icpc F77=ifort FC=ifort CFLAGS=-fpic -m64 CXXFLAGS=-fpic -m64
FFLAGS=-fpic FCFLAGS=-fpic --enable-f77 --enable-fc --enable-cxx
--enable-romio --enable-threads=default --with-hwloc
--disable-multi-aliases --enable-xrc=yes --enable-hybrid
--prefix=/usr/local/other/SLES11.1/mvapich2/1.8.1/intel-13.1.2.183
--
Ben Auer, PhD SSAI, Scientific Programmer/Analyst
NASA GSFC, Global Modeling and Assimilation Office
Code 610.1, 8800 Greenbelt Rd, Greenbelt, MD 20771
Phone: 301-286-9176 Fax: 301-614-6246
More information about the mvapich-discuss
mailing list