[Mvapich-discuss] [MVAPICH-PLUS] RPM Request and Clarification

Panda, Dhabaleswar panda at cse.ohio-state.edu
Tue Dec 10 07:36:08 EST 2024


Hi,

Thanks for your note. We will be sending you the requested RPM shortly.

Please note that no more active developments are taking place for MVAPICH2-X and MVAPICH2-GDR. You can use the latest MVAPICH-Plus 4.0 release. This version is optimized for fast communication using both CPU and GPU buffers. It provides support for all different interconnects (including InfiniBand, RoCE, Slingshot, and Omni-Path), GPUs (NVIDIA, AMD, and Intel), and CPUs (Arm and x86). We strongly encourage you to move to the MVAPICH-Plus series.

Thanks,

DK Panda


________________________________________
From: Mvapich-discuss <mvapich-discuss-bounces+panda.2=osu.edu at lists.osu.edu> on behalf of Le Viet Duc via Mvapich-discuss <mvapich-discuss at lists.osu.edu>
Sent: Monday, December 9, 2024 2:57 AM
To: mvapich-discuss at lists.osu.edu
Cc: 권민우; 안도식; KISTI 사용자지원
Subject: [Mvapich-discuss] [MVAPICH-PLUS] RPM Request and Clarification

Dear Prof. Panda and MVAPICH Group, I hope this email finds you well. With the recent release of MVAPICH-PLUS providing optimized collectives for Hopper GPUs, KISTI Supercomputing Center (KR) is planning to conduct OMB on H200 GPUs. Since all
ZjQcmQRYFpfptBannerStart
This Message Is From an External Sender
This message came from outside your organization.
<https://us-phishalarm-ewt.proofpoint.com/EWT/v1/KGKeukY!vOQfsSqtA6YgpRdxXw-kGXsy5n0Z2EoOW2S-tEEW2YSnSwcjUrf-bKXgUMAD4dbIIN8fvMgqH9g8xRZ8-QEnv12hV2cfC8wVzaWOPra1cibQc-EO7N1oQ7UnzfAtDaBHiv7JTEd9wsMLAfLzDCl_$>
Report Suspicious

ZjQcmQRYFpfptBannerEnd
Dear Prof. Panda and MVAPICH Group,

I hope this email finds you well.
With the recent release of MVAPICH-PLUS providing optimized collectives for Hopper GPUs, KISTI Supercomputing Center (KR) is planning to conduct OMB on H200 GPUs.

Since all the provided rpm files are based on RHEL 8/9, we have submitted a rpm request through the online submission form last week (Dec 5th).
Please allow me to submit the request again in case the gitlab mailing agent failed to deliver the first one.


  *
Organization:  KISTI (S. Korea)
  *
MVAPICH Version: MVAPICH-PLUS 4
  *
CPU Arch: x86
  *
GPU Arch :Hopper H200 (sm_90)
  *
OS: CentOS 7.9.2009
  *
MOFED: 23.10-0.5.5.0
  *
Compiler: gcc12.2.0
  *
Launcher: slurm
  *
Runtime: Cuda
  *
Runtime Version: CUDA 12.3
  *
Other: Slurm 24.05.3

Could you also clarify if my understanding below is correct ?

  *
MVAPICH2-X: optimized for CPU with SHARP support on CPU buffers.
  *
MVAPICH2-GDR: optimized for heterogenous systems without SHARP support on GPU buffers.
  *
MVAPICH-PLUS: X+GDR with SHARP support on GPU buffers
     *
In other words, is the PLUS version equivalent to NVIDIA's NCCL ?
     *
Does the PLUS version also support  all_reduce through NVLINK4 switches ?

Please pardon my confusion due to current product segmentation.

Regards.
Viet-Duc Le


More information about the Mvapich-discuss mailing list