[mvapich-discuss] mvapich-discuss Digest, Vol 176, Issue 6

Ghazimirsaeed, Seyedeh Mahdieh ghazimirsaeed.3 at osu.edu
Fri Aug 21 15:49:56 EDT 2020


Hi Grover,

We installed MVAPICH with the same configuration flags you mentioned on our local cluster but we couldn't reproduce it with WRF 3.6.

Can you please use this flag " --disable-mcast" for configuring MVAPICH and see if it fixes this failure.

Thanks,
Mahdieh

On 8/21/20, 12:00 PM, "mvapich-discuss on behalf of mvapich-discuss-request at cse.ohio-state.edu" <mvapich-discuss-bounces at cse.ohio-state.edu on behalf of mvapich-discuss-request at cse.ohio-state.edu> wrote:

    Send mvapich-discuss mailing list submissions to
    	mvapich-discuss at cse.ohio-state.edu
    
    To subscribe or unsubscribe via the World Wide Web, visit
    	https://can01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fmailman.cse.ohio-state.edu%2Fmailman%2Flistinfo%2Fmvapich-discuss&data=02%7C01%7Cs.ghazimirsaeed%40queensu.ca%7C69256341b28740659d7708d845eb5d28%7Cd61ecb3b38b142d582c4efb2838b925c%7C1%7C0%7C637336224470541394&sdata=lI5jq83YI5eEQCEWtLBY1A19INUF76NtNF4GBGUysss%3D&reserved=0
    or, via email, send a message with subject or body 'help' to
    	mvapich-discuss-request at cse.ohio-state.edu
    
    You can reach the person managing the list at
    	mvapich-discuss-owner at cse.ohio-state.edu
    
    When replying, please edit your Subject line so it is more specific
    than "Re: Contents of mvapich-discuss digest..."
    
    
    Today's Topics:
    
       1. mvapich2.3.4 mcast error (Gong-Do Hwang)
    
    
    ----------------------------------------------------------------------
    
    Message: 1
    Date: Thu, 20 Aug 2020 17:25:04 +0800
    From: Gong-Do Hwang <grover.hwang at gmail.com>
    To: mvapich-discuss at cse.ohio-state.edu
    Subject: [mvapich-discuss] mvapich2.3.4 mcast error
    Message-ID:
    	<CAFF9WUs=cJ87BM7EbXGkdZNvx5XHLQY4uxK3gERxiJkezzTaoQ at mail.gmail.com>
    Content-Type: text/plain; charset="utf-8"
    
    Hi,
    
    I was using mvapich2.3.4 to run WRF 4.1.5 over MLNX OFED 5.0. And I found
    it when using node larger than 8 I had error message below when the WRF
    integration started:
    
    [cn01:mpi_rank_0][mv2_mcast_resend_window] Failed to post mcast send
    errno:12
    [cn01:mpi_rank_0][mv2_mcast_resend_window] Failed to post mcast send
    errno:12
    [cn01:mpi_rank_0][mv2_mcast_resend_window] Failed to post mcast send
    errno:12
    
    mvapich2 was configured with the following flags:
    ./configure --prefix=$prefix --with-device=ch3:mrail  --with-rdma=gen2
    --enable-threads=multiple  --enable-rdma-cm --enable-threads=multiple
    --enable-romio --with-ch3-rank-bits=32  --with-ib-include=/usr/include
    --with-ib-libpath=/usr/lib64 --with-ibverbs-include=/usr/include
    --with-ibverbs-lib=/usr/lib64 CC=icc CFLAGS="-fPIC" F77=ifort
    FFLAGS="-fPIC" FC=ifort FCFLAGS="-fPIC" CXX=icpc CXXFLAGS="-fPIC"
    
    and run script and ARGS:
    
    export MV2_ENABLE_AFFINITY=0
    export MV2_IBA_HCA=mlx5_0
    mpiexec -rmk pbs  -np ${NPROCS}   ./wrf.exe
    
    I assigned the HCA because we have another mlx ehternet hca on the compute
    node.
    
    Is there any place I can find what error no =12 means? And is there any
    workaround for this?
    Thanks so much for your help!
    
    Grover
    -------------- next part --------------
    An HTML attachment was scrubbed...
    URL: <https://can01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fmailman.cse.ohio-state.edu%2Fpipermail%2Fmvapich-discuss%2Fattachments%2F20200820%2Ffdf09584%2Fattachment-0001.html&data=02%7C01%7Cs.ghazimirsaeed%40queensu.ca%7C69256341b28740659d7708d845eb5d28%7Cd61ecb3b38b142d582c4efb2838b925c%7C1%7C0%7C637336224470541394&sdata=H%2Bm5ZkCh9YaoENuPEQtOdildM7xAvVjHDZXjD0oiY7s%3D&reserved=0>
    
    ------------------------------
    
    Subject: Digest Footer
    
    _______________________________________________
    mvapich-discuss mailing list
    mvapich-discuss at cse.ohio-state.edu
    https://can01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fmailman.cse.ohio-state.edu%2Fmailman%2Flistinfo%2Fmvapich-discuss&data=02%7C01%7Cs.ghazimirsaeed%40queensu.ca%7C69256341b28740659d7708d845eb5d28%7Cd61ecb3b38b142d582c4efb2838b925c%7C1%7C0%7C637336224470541394&sdata=lI5jq83YI5eEQCEWtLBY1A19INUF76NtNF4GBGUysss%3D&reserved=0
    
    
    ------------------------------
    
    End of mvapich-discuss Digest, Vol 176, Issue 6
    ***********************************************
    




More information about the mvapich-discuss mailing list