[Mvapich-discuss] Failed to unpack MVAPICH-Plus RPM

You, Zhi-Qiang zyou at osc.edu
Sat Jan 11 21:32:17 EST 2025


Hi DK,

Thank you for the prompt fix. The RPM is now functioning correctly. However, I encountered the following error while running a simple ping-pong MPI test over two nodes:

slurmstepd: error: pmijobid missing in fullinit command

I suspected this might be due to PMI incompatibility. I referred to this documentation<https://mvapich-docs.readthedocs.io/en/latest/cvar.html#mvapich-environment-variables> and learned about setting MVP_PMI_VERSION to 2 to align with our SLURM configuration. However, the issue persists. I also checked the output of mpichversion -a and confirmed that the --with-pmi=pmi2 option is enabled, leading me to conclude that this is not a PMI compatibility issue.

Additionally, I have a few related questions:

  1.  Will there be an MVAPICH 4.0 release, or will it be replaced by the MVAPICH-Plus CPU-only version?
  2.  The documentation linked above lists many environment variables that I haven’t encountered before when using MVAPICH2-GDR. Are these new variables specific to MVAPICH 4.0? Are variables like MV2_USE_CUDA/MVP_USE_CUDA still available, or should they be replaced with MVP_ENABLE_GPU?
  3.  Could you help confirm if the following variables are still supported in MVAPICH?
     *   MVP_USE_RDMA_CM
     *   MVP_HOMOGENEOUS_CLUSTER
     *   MVP_IBA_HCA

Thank you for your time and assistance!

Best regards,
ZQ


From: Panda, Dhabaleswar <panda at cse.ohio-state.edu>
Date: Saturday, January 11, 2025 at 3:14 AM
To: You, Zhi-Qiang <zyou at osc.edu>, Announcement about MVAPICH (MPI over InfiniBand, RoCE, Omni-Path, Slingshot, iWARP and EFA) Libraries developed at NBCL/OSU <mvapich-discuss at lists.osu.edu>
Subject: RE: Failed to unpack MVAPICH-Plus RPM
Hi ZQ,

As we have communicated with you separately, a new RPM has been uploaded. Please try this version and let us know whether you see any additional issues.

DK

From: Mvapich-discuss <mvapich-discuss-bounces at lists.osu.edu> On Behalf Of You, Zhi-Qiang via Mvapich-discuss
Sent: Thursday, January 2, 2025 1:54 PM
To: mvapich-discuss at lists.osu.edu
Subject: [Mvapich-discuss] Failed to unpack MVAPICH-Plus RPM

Hello,

I downloaded the MVAPICH-Plus 4.0 RPM from the following link:
https://mvapich.cse.ohio-state.edu/download/mvapich/plus/4.0/cuda/UCX/mofed5.0/mvapich-plus-4.0-cuda12.4.rhel9.ofed24.10.ucx.gcc13.2.0.slurm-4.0-1.x86_64.rpm, but I encountered an issue when trying to unpack it using cpio. The process failed with the error:

cpio: premature end of file

I have no issues unpacking other RPMs, so it seems this file might be corrupted. Could you please check and confirm?

Thank you,
ZQ

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osu.edu/pipermail/mvapich-discuss/attachments/20250112/0286381a/attachment-0002.html>


More information about the Mvapich-discuss mailing list