[mvapich-discuss] MVAPICH2-2.3.3 giving me floating point error (signal 8)
Shaleen Garg
shaleen.garg at rutgers.edu
Mon Jun 22 09:47:56 EDT 2020
Hi All,
I am trying to install mvapich on a machine with Mellanox IB:
$lspci | grep “Mellanox”
Network controller: Mellanox Technologies MT27500 Family [ConnectX-3]
To install, I followed the user guide. Since this is a new machine, I have installed the following packages ( on ubuntu 18.04 with linux version 4.15.0-55-generic): libibmad-dev libibumad-dev libibumad3 libibverbs-dev gfortran infiniband-diags rdma-core.
Installation Method:
$ ./configure --with-device=ch3:mrail --with-rdma=gen2
$ make -j
$ sudo make install
Now this installs fine. But, when I run a hello world program:
$ mpirun -env MV2_SMP_USE_CMA=0 -np 10 ./a.out
I get the following error:
[apt140:mpi_rank_2][error_sighandler] Caught error: Floating point exception (signal 8)
…
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= PID 13854 RUNNING AT apt140
= EXIT CODE: 8
= CLEANING UP REMAINING PROCESSES
= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
===================================================================================
Is there something I am missing ? I don’t know why even within the node, mpi hello world is not working. The code I am testing on comes from https://urldefense.com/v3/__https://mpitutorial.com/tutorials/mpi-hello-world/__;!!KGKeukY!ndjrdSb_kPhyBYYFaVpap2wx7Sjs9GYbPHbGiLhdLYm6Fywr1qbtlIARBeAoBItHbbeNCglyHo71nTU$
Regards,
Shaleen Garg
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20200622/2973db9b/attachment.html>
More information about the mvapich-discuss
mailing list