[mvapich-discuss] cudaHostUnegister Failed: Invalid argument (22)

John Donners john.donners at surfsara.nl
Fri Nov 27 05:13:55 EST 2015


Dear developers,

I downloaded and installed the mvapich2-gdr-2.1 RPM.
I'm getting an error in MPI_Finalize when running an application of one 
of our users:

[gcn1:mpi_rank_0][ibv_cuda_unregister]
src/mpid/ch3/channels/mrail/src/gen2/ibv_cuda_util.c:1543:
cudaHostUnegister Failed: Invalid argument (22)

and a backtrace of the resulting core dump gives:

(gdb) bt
#0  0x00002b58cf546eba in ?? () from /usr/lib64/libcuda.so.1
#1  0x00002b58cf555954 in ?? () from /usr/lib64/libcuda.so.1
#2  0x00002b58cf4c9b1f in ?? () from /usr/lib64/libcuda.so.1
#3  0x00002b58cf48ffb2 in cuStreamDestroy_v2 () from /usr/lib64/libcuda.so.1
#4  0x00002b58ccd2e4f0 in ?? () from 
/hpc/sw/cuda/7.0.28//lib64/libcudart.so.7.0
#5  0x00002b58ccd6332d in cudaStreamDestroy () from 
/hpc/sw/cuda/7.0.28//lib64/libcudart.so.7.0
#6  0x00002b58cd54bd31 in deallocate_cuda_rndv_streams () from 
/hpc/sw/mvapich2-gdr-2.1-cuda70-intel/lib64/libmpi.so.12
#7  0x00002b58cd548677 in cuda_cleanup () from 
/hpc/sw/mvapich2-gdr-2.1-cuda70-intel/lib64/libmpi.so.12
#8  0x00002b58cd4cd3c7 in MPID_Finalize () from 
/hpc/sw/mvapich2-gdr-2.1-cuda70-intel/lib64/libmpi.so.12
#9  0x00002b58cd426be2 in PMPI_Finalize () from 
/hpc/sw/mvapich2-gdr-2.1-cuda70-intel/lib64/libmpi.so.12
#10 0x00000000004093f9 in main (argc=1, argv=0x7fff67704158) at lbe.c:1301

It looks at first sight an issue with the memory cleanup in mvapich2. 
Let me know if I can help with the further investigation of this issue.

With regards,
John

HPC Center SURFsara, Amsterdam


More information about the mvapich-discuss mailing list