[Mvapich-discuss] Failed to unpack MVAPICH-Plus RPM
You, Zhi-Qiang
zyou at osc.edu
Sat Jan 11 21:32:17 EST 2025
Hi DK,
Thank you for the prompt fix. The RPM is now functioning correctly. However, I encountered the following error while running a simple ping-pong MPI test over two nodes:
slurmstepd: error: pmijobid missing in fullinit command
I suspected this might be due to PMI incompatibility. I referred to this documentation<https://mvapich-docs.readthedocs.io/en/latest/cvar.html#mvapich-environment-variables> and learned about setting MVP_PMI_VERSION to 2 to align with our SLURM configuration. However, the issue persists. I also checked the output of mpichversion -a and confirmed that the --with-pmi=pmi2 option is enabled, leading me to conclude that this is not a PMI compatibility issue.
Additionally, I have a few related questions:
1. Will there be an MVAPICH 4.0 release, or will it be replaced by the MVAPICH-Plus CPU-only version?
2. The documentation linked above lists many environment variables that I haven’t encountered before when using MVAPICH2-GDR. Are these new variables specific to MVAPICH 4.0? Are variables like MV2_USE_CUDA/MVP_USE_CUDA still available, or should they be replaced with MVP_ENABLE_GPU?
3. Could you help confirm if the following variables are still supported in MVAPICH?
* MVP_USE_RDMA_CM
* MVP_HOMOGENEOUS_CLUSTER
* MVP_IBA_HCA
Thank you for your time and assistance!
Best regards,
ZQ
From: Panda, Dhabaleswar <panda at cse.ohio-state.edu>
Date: Saturday, January 11, 2025 at 3:14 AM
To: You, Zhi-Qiang <zyou at osc.edu>, Announcement about MVAPICH (MPI over InfiniBand, RoCE, Omni-Path, Slingshot, iWARP and EFA) Libraries developed at NBCL/OSU <mvapich-discuss at lists.osu.edu>
Subject: RE: Failed to unpack MVAPICH-Plus RPM
Hi ZQ,
As we have communicated with you separately, a new RPM has been uploaded. Please try this version and let us know whether you see any additional issues.
DK
From: Mvapich-discuss <mvapich-discuss-bounces at lists.osu.edu> On Behalf Of You, Zhi-Qiang via Mvapich-discuss
Sent: Thursday, January 2, 2025 1:54 PM
To: mvapich-discuss at lists.osu.edu
Subject: [Mvapich-discuss] Failed to unpack MVAPICH-Plus RPM
Hello,
I downloaded the MVAPICH-Plus 4.0 RPM from the following link:
https://mvapich.cse.ohio-state.edu/download/mvapich/plus/4.0/cuda/UCX/mofed5.0/mvapich-plus-4.0-cuda12.4.rhel9.ofed24.10.ucx.gcc13.2.0.slurm-4.0-1.x86_64.rpm, but I encountered an issue when trying to unpack it using cpio. The process failed with the error:
cpio: premature end of file
I have no issues unpacking other RPMs, so it seems this file might be corrupted. Could you please check and confirm?
Thank you,
ZQ
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osu.edu/pipermail/mvapich-discuss/attachments/20250112/0286381a/attachment-0002.html>
More information about the Mvapich-discuss
mailing list