<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
Hi Tony,</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)" class="elementToProof">
Thank you for bringing these issues to our attention, I've created a ticket in our system to track it. Our team is looking into it and we will follow up as soon as we have an update!</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)" class="elementToProof">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)" class="elementToProof">
Thank you,</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)" class="elementToProof">
Brian Seeds<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)" class="elementToProof">
<br>
</div>
<div id="appendonsend"></div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font style="font-size:11pt" face="Calibri, sans-serif" color="#000000"><b>From:</b> Mvapich-discuss <mvapich-discuss-bounces@lists.osu.edu> on behalf of Tony Niro via Mvapich-discuss <mvapich-discuss@lists.osu.edu><br>
<b>Sent:</b> Wednesday, May 11, 2022 9:41 AM<br>
<b>To:</b> mvapich-discuss@lists.osu.edu <mvapich-discuss@lists.osu.edu><br>
<b>Subject:</b> [Mvapich-discuss] Issue with GPCNET in UD mode with mvapich2-2.3.7 + patch for Rockport</font>
<div> </div>
</div>
<div>
<div style="display:none!important; visibility:hidden; font-size:1px; color:#ffffff; line-height:1px; max-height:0px; max-width:0px; opacity:0; overflow:hidden">
Hi all, We have built the GPCNET 1.2 application against mvpaich2-2.3.7 (+ patch for Rockport). Frequently, when we run the network_load_test with MV2_USE_UD_ONLY=1, the application hangs. When we try to get back-traces of all the running applications,
</div>
<div style="display:none!important; visibility:hidden; font-size:1px; color:#ffffff; line-height:1px; max-height:0px; max-width:0px; opacity:0; overflow:hidden">
ZjQcmQRYFpfptBannerStart</div>
<div dir="ltr" id="x_pfptBanner29d9mku" style="display: block !important; visibility: visible !important; opacity: 1 !important; background-color: rgb(207, 211, 215) !important; max-width: none !important; max-height: none !important;display:block!important; text-align:left!important; margin:16px 0px 16px 0px!important; padding:8px 16px 8px 16px!important; border-radius:4px!important; min-width:200px!important; background-color:#CFD3D7!important; border-top:4px solid #8c8e91!important" lang="en">
<div id="x_pfptBanner29d9mku" style="display: block !important; visibility: visible !important; opacity: 1 !important; background-color: rgb(207, 211, 215) !important; max-width: none !important; max-height: none !important;float:left!important; display:block!important; margin:0px 0px 1px 0px!important; max-width:600px!important">
<div id="x_pfptBanner29d9mku" style="display: block !important; visibility: visible !important; opacity: 1 !important; background-color: rgb(207, 211, 215) !important; max-width: none !important; max-height: none !important;display:block!important; visibility:visible!important; background-color:#CFD3D7!important; color:#000000!important; font-family:'Arial',sans-serif!important; font-weight:bold!important; font-size:14px!important; line-height:18px!important">
This Message Is From an External Sender </div>
<div id="x_pfptBanner29d9mku" style="display: block !important; visibility: visible !important; opacity: 1 !important; background-color: rgb(207, 211, 215) !important; max-width: none !important; max-height: none !important;display:block!important; visibility:visible!important; background-color:#CFD3D7!important; color:#000000!important; font-weight:normal!important; font-family:'Arial',sans-serif!important; font-size:12px!important; line-height:18px!important; margin-top:2px!important">
This message came from outside your organization. </div>
</div>
<div id="x_pfptBanner29d9mku" style="display: block !important; visibility: visible !important; opacity: 1 !important; background-color: rgb(207, 211, 215) !important; max-width: none !important; max-height: none !important;float:right!important; display:block!important; margin:0px 0px 0px 16px!important; text-align:right!important; width:fit-content!important">
<a href="https://us-phishalarm-ewt.proofpoint.com/EWT/v1/KGKeukY!v4QekagND6YASrkesc-ldq2ozhmouo3eboxNtYm0MGN45oEfwt5p5rnBp5Pswba5ci9RmmkXBV58xZJs4lGV1L3dQxN0N1EGyuDYTgh3WSlxZopbOXCnp4HTXQDYF6_K6TLCcv89H1kBZRfkRTQMnq7WKE0$" data-auth="NotApplicable" id="x_pfptBanner29d9mku" style="display: block !important; visibility: visible !important; opacity: 1 !important; background-color: rgb(207, 211, 215) !important; max-width: none !important; max-height: none !important;display:inline-block!important">
<div class="x_pfptPrimaryButton29d9mku" style="display:inline-block!important; visibility:visible!important; opacity:1!important; color:#000000!important; font-family:'Arial',sans-serif!important; font-size:14px!important; font-weight:normal!important; text-decoration:none!important; border-radius:2px!important; padding:7.5px 16px!important; margin:3px 0 3px 16px!important; white-space:nowrap!important; width:fit-content!important; border:1px solid #666666!important">
Report Suspicious </div>
</a></div>
<div style="clear:both!important; display:block!important; visibility:hidden!important; line-height:0!important; font-size:0.01px!important">
</div>
</div>
<div style="display:none!important; visibility:hidden; font-size:1px; color:#ffffff; line-height:1px; max-height:0px; max-width:0px; opacity:0; overflow:hidden">
ZjQcmQRYFpfptBannerEnd</div>
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<div class="x_WordSection1">
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
Hi all,</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
We have built the GPCNET 1.2 application against mvpaich2-2.3.7 (+ patch for Rockport). Frequently, when we run the network_load_test with MV2_USE_UD_ONLY=1, the application hangs. When we try to get back-traces of all the running applications, the test unhangs
and completes. We get the back-races by calling <b>gdb -batch -ex "attach $pid" -ex "bt" -ex "detach
</b>on every network_load_test process on every server.</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
We are looking for some guidance on how to debug this problem, for example, are there any other options that we should be setting, etc.</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
Note that the application runs with no issues if we us MV2_USE_UD_HYBRID=0 instead of MV2_USE_UD_ONLY=1. Also note that the results of the failed test seem reasonable. I’ve included output for the hang scenario as well as one where the application run normally.</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
Tony Niro</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
Example of run that hung/resumed:</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
________________________________________________</p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">user@dell-s13-h1[11:41:04] ~>/usr/bin/time --format="%e seconds" /opt/bm/hpc/mvapich2-2.3.7-wc-patched-ng/bin/mpiexec -np 1504 -f /home/user/mpi-host.cfg -env MV2_HOMOGENEOUS_CLUSTER=1 -env MV2_NDREG_ENTRIES_MAX=100000
-env MV2_NDREG_ENTRIES=50000 -env MV2_IBA_HCA=mlx5_0 -env MV2_SHMEM_COLL_NUM_COMM=64 -env MV2_UD_ZCOPY_NUM_RETRY=1000000 -env MV2_NUM_QP_PER_PORT=1 -env MV2_USE_UD_ONLY=1 /opt/bm/hpc/mvapich-GPCNET-1.2/network_load_test_tn</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">[dell-s13-h21:mpi_rank_0][rdma_get_user_parameters] Cannot have more than one QP with UD_ONLY / Hybrid mode.</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">[dell-s13-h21:mpi_rank_0][rdma_get_user_parameters] Resetting MV2_NUM_QP_PER_PORT to 1.</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">NetworkLoad Tests v1.2</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> Test with 1504 MPI ranks (47 nodes)</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> 10 nodes running Network Tests</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> 37 nodes running Congestion Tests (min 9 nodes per congestor)</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> Legend</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> RR = random ring communication pattern</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> Lat = latency</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> BW = bandwidth</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> BW+Sync = bandwidth with barrier</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+------------------------------------------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Isolated Network Tests |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Name | Avg | 99% | Units |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided Lat (8 B) | 3.1 | 6.8 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided BW+Sync (131072 B) | 282.1 | 189.9 | MiB/s/rank |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Multiple Allreduce (8 B) | 18.2 | 29.3 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+------------------------------------------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Network Tests running with Congestion Tests |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Name | Avg | 99% | Units |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided Lat (8 B) | 3.7 | 11.5 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-size:10.5pt; font-family:"Courier New"; color:#172B4D; background:#F4F5F7"><HANG>
</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-size:10.5pt; font-family:"Courier New"; color:#172B4D; background:#F4F5F7"><attach debugger to all processes to get stack trace></span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided BW+Sync (131072 B) | 201.7 | 130.8 | MiB/s/rank |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Multiple Allreduce (8 B) | 21.0 | 38.1 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+------------------------------------------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Network Tests running with Congestion Tests - Key Results |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Name | Congestion Impact Factor |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| | Avg | 99% |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided Lat (8 B) | 1.2X | 1.7X |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided BW+Sync (131072 B) | 1.4X | 1.5X |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Multiple Allreduce (8 B) | 1.2X | 1.3X |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">4851.38 seconds</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">user@dell-s13-h1[13:07:00] ~></span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">Example of successful run</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">user@dell-s13-h1[11:20:50] ~>/usr/bin/time --format="%e seconds" /opt/bm/hpc/mvapich2-2.3.7-wc-patched-ng/bin/mpiexec -np 1504 -f /home/user/mpi-host.cfg -env MV2_HOMOGENEOUS_CLUSTER=1 -env MV2_NDREG_ENTRIES_MAX=100000
-env MV2_NDREG_ENTRIES=50000 -env MV2_IBA_HCA=mlx5_0 -env MV2_SHMEM_COLL_NUM_COMM=64 -env MV2_UD_ZCOPY_NUM_RETRY=1000000 -env MV2_NUM_QP_PER_PORT=1 -env MV2_USE_UD_ONLY=1 /opt/bm/hpc/mvapich-GPCNET-1.2/network_load_test_tn</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">[dell-s13-h21:mpi_rank_0][rdma_get_user_parameters] Cannot have more than one QP with UD_ONLY / Hybrid mode.</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">[dell-s13-h21:mpi_rank_0][rdma_get_user_parameters] Resetting MV2_NUM_QP_PER_PORT to 1.</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">NetworkLoad Tests v1.2</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> Test with 1504 MPI ranks (47 nodes)</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> 10 nodes running Network Tests</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> 37 nodes running Congestion Tests (min 9 nodes per congestor)</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> Legend</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> RR = random ring communication pattern</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> Lat = latency</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> BW = bandwidth</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> BW+Sync = bandwidth with barrier</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+------------------------------------------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Isolated Network Tests |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Name | Avg | 99% | Units |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided Lat (8 B) | 3.0 | 6.8 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided BW+Sync (131072 B) | 270.8 | 182.0 | MiB/s/rank |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Multiple Allreduce (8 B) | 19.0 | 29.8 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+------------------------------------------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Network Tests running with Congestion Tests |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Name | Avg | 99% | Units |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided Lat (8 B) | 3.7 | 11.9 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided BW+Sync (131072 B) | 188.0 | 122.5 | MiB/s/rank |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Multiple Allreduce (8 B) | 21.6 | 39.5 | usec |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------+--------------+--------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+------------------------------------------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Network Tests running with Congestion Tests - Key Results |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+--------------------------------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Name | Congestion Impact Factor |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| | Avg | 99% |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided Lat (8 B) | 1.2X | 1.8X |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| RR Two-sided BW+Sync (131072 B) | 1.4X | 1.5X |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">| Multiple Allreduce (8 B) | 1.1X | 1.3X |</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">+---------------------------------+----------------------+---------------------+</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New"">204.81 seconds</span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
<span style="font-family:"Courier New""> </span></p>
<p class="x_MsoNormal" style="margin-top: 0px; margin-bottom: 0px;margin: 0in; font-size: 11pt; font-family: "Calibri", sans-serif;">
</p>
</div>
</div>
</body>
</html>