[mvapich-discuss] MVAPICH2-GDR/2.2-2 problems on RHEL 7.3 system
Raghu Reddy
raghu.reddy at noaa.gov
Fri Apr 21 08:30:04 EDT 2017
Hi Hari,
Carl Ponder found the following link that seems similar to what I am seeing now, but this was some time ago and was with RHEL 6:
http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/2014-February/004810.html
I was wondering if that may give you some clues?
Thanks,
Raghu
From: Raghu Reddy [mailto:raghu.reddy at noaa.gov]
Sent: Thursday, April 20, 2017 7:28 PM
To: 'Hari Subramoni'
Cc: mvapich-discuss at cse.ohio-state.edu; 'Raghu Reddy'
Subject: RE: [mvapich-discuss] MVAPICH2-GDR/2.2-2 problems on RHEL 7.3 system
Hi Hari,
We are using the open source OFED included with RHEL 7.3.
Thanks,
Raghu
From: hari.subramoni at gmail.com [mailto:hari.subramoni at gmail.com] On Behalf Of Hari Subramoni
Sent: Thursday, April 20, 2017 5:22 PM
To: Raghu Reddy <raghu.reddy at noaa.gov>
Cc: mvapich-discuss at cse.ohio-state.edu
Subject: Re: [mvapich-discuss] MVAPICH2-GDR/2.2-2 problems on RHEL 7.3 system
Hi Dr. Reddy,
It looks like the version of OFED you have installed on the system does not support XRC. Could you please let us know what version OFED you've installed on the system? Is it MOFED 3.2?
Best Regards,
Hari.
On Thu, Apr 20, 2017 at 4:46 PM, Raghu Reddy <raghu.reddy at noaa.gov> wrote:
Hi all,
We are in the process of upgrading our system to RHEL 7.3, and are having problems trying to compile a simple program with the mvapich2-gdr/2.2-2 library.
Our production system is running the following OS:
- Red Hat Enterprise Linux Server release 6.8 (Santiago)
And we’ve been running mvapich2-gdr/2.2.1 on that system for some time now.
However, now we’re in the process of upgrading to RHEL 7.3 and have that installed on a smaller test system:
System information for the test system:
- Each node consists of 2 Haswell processors with 10 cores each.
- Each node has 8 Tesla P100 (Pascal) GPUs.
- The interconnect is using 1 Mellanox Connect-X 3 IB card connected to socket 1.
- Red Hat Enterprise Linux Server release 7.3 (Maipo)
Initially I tried to use the existing mvapich2-gdr/2.2.1 and that one failed to compile a simple program. So I went ahead and downloaded the from RHEL/CENTOS 7 secction of the mvapich2-gdr 2.2 library. Since there was only one option available that was suitable for us with the following combination:
- Intel 16.0.2 without SLURM
- CUDA 8.0
I downloaded the version from the MLNX-OFED 3.2 row in that table.
The error I am getting when trying to compile it is the following:
sg001% module purge
sg001% module load intel/16.1.150 cuda/8.0 mvapich2-gdr/2.2-2-cuda-8.0-intel
sg001%
sg001% mpicc -g -o osu_bibw osu_bibw.c
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_modify_xrc_rcv_qp at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_unreg_xrc_rcv_qp at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_open_xrc_domain at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_create_xrc_srq at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_close_xrc_domain at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_reg_xrc_rcv_qp at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_create_xrc_rcv_qp at IBVERBS_1.1'
sg001%
- One question is why is this specifically trying to link to ibverbs_1.1?
- Are we missing some configuration step?
- Are we missing any of the RPMs?
Any advice his very much appreciated!
I’m including the output from the verbose option below.
Thanks,
Raghu
=================== verbose output ==================
mpicc for MVAPICH2 version 2.2
icc version 16.0.1 (gcc version 4.8.5 compatibility)
/apps/intel/compilers_and_libraries_2016.1.150/linux/bin/intel64/mcpcom --target_efi2 --lang=c -_g -mP3OPT_inline_alloca -D__ICC=1600 -D__
INTEL_COMPILER=1600 -D__INTEL_COMPILER_UPDATE=1 -D__PTRDIFF_TYPE__=long "-D__SIZE_TYPE__=unsigned long" -D__WCHAR_TYPE__=int "-D__WINT_TYPE__
=unsigned int" "-D__INTMAX_TYPE__=long int" "-D__UINTMAX_TYPE__=long unsigned int" -D__LONG_MAX__=9223372036854775807L -D__QMSPP_ -D__OPTIMIZ
E__ -D__NO_MATH_INLINES -D__NO_STRING_INLINES -D__GNUC_GNU_INLINE__ -D__GNUC__=4 -D__GNUC_MINOR__=8 -D__GNUC_PATCHLEVEL__=5 -D__LP64__ -D_LP6
4 -D__GXX_ABI_VERSION=1002 "-D__USER_LABEL_PREFIX__= " -D__REGISTER_PREFIX__= -D__INTEL_RTTI__ -D__EXCEPTIONS=1 -D__unix__ -D__unix -D__linux
__ -D__linux -D__gnu_linux__ -B -Dunix -Dlinux "-_Asystem(unix)" -D__ELF__ -D__x86_64 -D__x86_64__ -D__amd64 -D__amd64__ "-_Acpu(x86_64)" "-_
Amachine(x86_64)" -D__INTEL_COMPILER_BUILD_DATE=20151021 -D__INTEL_OFFLOAD -D__i686 -D__i686__ -D__pentiumpro -D__pentiumpro__ -D__pentium4 -
D__pentium4__ -D__tune_pentium4__ -D__SSE2__ -D__SSE2_MATH__ -D__SSE__ -D__SSE_MATH__ -D__MMX__ -_k -_8 -_l --has_new_stdarg_support -_a -_b
--gnu_version=40805 -_W5 --gcc-extern-inline --c_exceptions --multibyte_chars -mGLOB_diag_suppress_sys -I/usr/local/cuda-8.0/include -I/apps/
mvapich2-gdr-cuda8.0-intel/2.2-2/include --array_section --simd --simd_func --offload_mode=1 --offload_target_names=gfx,GFX,mic,MIC --offload
_unique_string=icc010330396754DpBdBk -D_FORTIFY_SOURCE=2 -mGLOB_em64t=TRUE -mP1OPT_version=16.0-intel64 -mGLOB_diag_enable_disable=E:level1 -
mGLOB_diag_file=/tmp/iccAPK0ST.diag -mP1OPT_print_version=FALSE -mCG_use_gas_got_workaround=F -mP2OPT_align_option_used=TRUE -mGLOB_gcc_versi
on=485 "-mGLOB_options_string=-I/usr/local/cuda-8.0/include -I/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/include -O2 -g -pipe -Wall -Wp,-D_FORTIF
Y_SOURCE=2 -fexceptions -fstack-protector-strong --param=ssp-buffer-size=4 -grecord-gcc-switches -m64 -mtune=generic -v -g -o osu_bibw -L/app
s/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64 -Wl,-rpath -Wl,/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64 -Wl,--enable-new-dtags -lmpi" -mGLOB_cxx
_limited_range=FALSE -mCG_extend_parms=FALSE -mGLOB_compiler_bin_directory=/apps/intel/compilers_and_libraries_2016.1.150/linux/bin/intel64 -
mP3OPT_emit_line_numbers -mGLOB_debug_target=GLOB_DEBUG_TARGET_ALL -mDEBUG_record_switches -mDEBUG_info_level=2 -mDEBUG_use_indirect_strings=
TRUE -mIPOPT_ninl_debug_info=TRUE -mDEBUG_emit_dwarf_inline_info=TRUE -mDEBUG_debug_ranges=TRUE -mGLOB_debug_format=GLOB_DEBUG_FORMAT_DWARF30
-mGLOB_as_output_backup_file_name=/tmp/iccyBC07bas_.s -mGLOB_dashboard_use_source_name -mIPOPT_activate -mIPOPT_lite -mGLOB_instruction_tuni
ng=0x0 -mGLOB_product_id_code=0x22006d91 -mCG_bnl_movbe=T -mGLOB_extended_instructions=0x8 -mP3OPT_use_mspp_call_convention -mP2OPT_subs_out_
of_bound=FALSE -mP2OPT_disam_type_based_disam=2 -mP2OPT_disam_assume_ansi_c -mP2OPT_checked_disam_ansi_alias=TRUE -mGLOB_ansi_alias -mPGOPTI_
value_profile_use=T -mGLOB_opt_report_use_source_name -mCG_stack_security_check=0x75 -mP2OPT_il0_array_sections=TRUE -mGLOB_offload_mode=1 -m
P2OPT_offload_unique_var_string=icc010330396754DpBdBk -mGLOB_opt_level=2 -mP2OPT_hlo_level=2 -mP2OPT_hlo -mP2OPT_hpo_rtt_control=0 -mIPOPT_ar
gs_in_regs=0 -mP2OPT_disam_assume_nonstd_intent_in=FALSE -mGLOB_imf_mapping_library=/apps/intel/compilers_and_libraries_2016.1.150/linux/bin/
intel64/libiml_attr.so -mIPOPT_single_file_compile_and_link=TRUE -mP2OPT_hlo_embed_loopinfo -mPGOPTI_gen_threadsafe_level=0 -mIPOPT_lto_objec
t_enabled -mIPOPT_lto_object_value=1 -mIPOPT_obj_output_file_name=/tmp/iccAPK0ST.o -mIPOPT_whole_archive_fixup_file_name=/tmp/iccwarch9YWwnu
-mGLOB_linker_version=2.25.1 -mGLOB_long_size_64 -mGLOB_routine_pointer_size_64 -mGLOB_driver_tempfile_name=/tmp/icctempfileLHGnvD -mP3OPT_as
m_target=P3OPT_ASM_TARGET_GAS -mGLOB_async_unwind_tables=TRUE -mGLOB_obj_output_file=/tmp/iccAPK0ST.o -mGLOB_source_dialect=GLOB_SOURCE_DIALE
CT_C -mP1OPT_source_file_name=osu_bibw.c -mGLOB_eh_c_linux osu_bibw.c
#include "..." search starts here:
#include <...> search starts here:
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/include
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/include
/apps/cuda/cuda-8.0/include
/apps/intel/compilers_and_libraries_2016.1.150/linux/ipp/include
/apps/intel/compilers_and_libraries_2016.1.150/linux/tbb/include
/apps/intel/compilers_and_libraries_2016.1.150/linux/compiler/include/intel64
/apps/intel/compilers_and_libraries_2016.1.150/linux/compiler/include
/usr/local/include
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/include
/usr/include/
/usr/include
End of search list.
ld /usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../../lib64/crt1.o /usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../../lib64/crti.o /usr/li
b/gcc/x86_64-redhat-linux/4.8.5/crtbegin.o --eh-frame-hdr --build-id -dynamic-linker /lib64/ld-linux-x86-64.so.2 -m elf_x86_64 -L/apps/mvapic
h2-gdr-cuda8.0-intel/2.2-2/lib64 -o osu_bibw -L/apps/intel/compilers_and_libraries_2016.1.150/linux/compiler/lib/intel64 -L/apps/intel/compil
ers_and_libraries_2016.1.150/linux/ipp/intel64 -L/apps/intel/compilers_and_libraries_2016.1.150/linux/mkl/lib/intel64 -L/apps/intel/compilers
_and_libraries_2016.1.150/linux/tbb/lib/intel64/gcc4.4 -L/apps/intel/compilers_and_libraries_2016.1.150/linux/compiler/lib/intel64_lin -L/usr
/lib/gcc/x86_64-redhat-linux/4.8.5/ -L/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../../lib64 -L/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../..
/../../lib64/ -L/lib/../lib64 -L/lib/../lib64/ -L/usr/lib/../lib64 -L/usr/lib/../lib64/ -L/apps/intel/compilers_and_libraries_2016.1.150/linu
x/compiler/lib/intel64/ -L/apps/intel/compilers_and_libraries_2016.1.150/linux/mkl/lib/intel64/ -L/apps/intel/compilers_and_libraries_2016.1.
150/linux/tbb/lib/intel64/gcc4.4/ -L/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../ -L/lib64 -L/lib/ -L/usr/lib64 -L/usr/lib /tmp/iccAPK0ST.
o -rpath /apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64 --enable-new-dtags -lmpi -Bdynamic -Bstatic -limf -lsvml -lirng -Bdynamic -lm -Bstatic
-lipgo -ldecimal --as-needed -Bdynamic -lcilkrts -lstdc++ --no-as-needed -lgcc -lgcc_s -Bstatic -lirc -lsvml -Bdynamic -lc -lgcc -lgcc_s -Bst
atic -lirc_s -Bdynamic -ldl -lc /usr/lib/gcc/x86_64-redhat-linux/4.8.5/crtend.o /usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../../lib64/crtn
.o
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_modify_xrc_rcv_qp at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_unreg_xrc_rcv_qp at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_open_xrc_domain at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_create_xrc_srq at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_close_xrc_domain at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_reg_xrc_rcv_qp at IBVERBS_1.1'
/apps/mvapich2-gdr-cuda8.0-intel/2.2-2/lib64/libmpi.so: undefined reference to `ibv_create_xrc_rcv_qp at IBVERBS_1.1'
sg001%
_______________________________________________
mvapich-discuss mailing list
mvapich-discuss at cse.ohio-state.edu
http://mailman.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20170421/a8a1bac8/attachment-0001.html>
More information about the mvapich-discuss
mailing list