[mvapich-discuss] MVAPICH2-2.3.3 giving me floating point error (signal 8)

Shaleen Garg shaleen.garg at rutgers.edu
Mon Jun 22 09:47:56 EDT 2020


Hi All,

I am trying to install mvapich on a machine with Mellanox IB:


$lspci | grep “Mellanox”

Network controller: Mellanox Technologies MT27500 Family [ConnectX-3]

To install, I followed the user guide. Since this is a new machine, I have installed the following packages ( on ubuntu 18.04 with linux version 4.15.0-55-generic): libibmad-dev libibumad-dev libibumad3 libibverbs-dev gfortran infiniband-diags rdma-core.

Installation Method:

$ ./configure --with-device=ch3:mrail --with-rdma=gen2

$ make -j

$ sudo make install


Now this installs fine. But, when I run a hello world program:


$ mpirun -env MV2_SMP_USE_CMA=0 -np 10 ./a.out


I get the following error:

[apt140:mpi_rank_2][error_sighandler] Caught error: Floating point exception (signal 8)
…

===================================================================================

=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES

=   PID 13854 RUNNING AT apt140

=   EXIT CODE: 8

=   CLEANING UP REMAINING PROCESSES

=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES

===================================================================================

Is there something I am missing ? I don’t know why even within the node, mpi hello world is not working. The code I am testing on comes from https://urldefense.com/v3/__https://mpitutorial.com/tutorials/mpi-hello-world/__;!!KGKeukY!ndjrdSb_kPhyBYYFaVpap2wx7Sjs9GYbPHbGiLhdLYm6Fywr1qbtlIARBeAoBItHbbeNCglyHo71nTU$ 


Regards,
Shaleen Garg
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20200622/2973db9b/attachment.html>


More information about the mvapich-discuss mailing list