[mvapich-discuss] Spawn support in MVAPICH2 v2.0
Jonathan Perkins
perkinjo at cse.ohio-state.edu
Thu Sep 4 07:36:42 EDT 2014
Hi Adam, I have a few questions.
Is osu_latency is working for you with this build?
Have you tried setting MV2_DEBUG_SHOW_BACKTRACE=1?
Have you set MV2_SUPPORT_DPM=1?
The error appears to be before MPI_Init so there could be some basic problem
with launching. However, after launching you will definitely need to set
MV2_SUPPORT_DPM=1 for your program to work correctly.
On Wed, Sep 03, 2014 at 05:56:13PM -0700, Adam T. Moody wrote:
> Hello MVAPICH team,
> I have someone interested in using the dynamic process management in MPI2,
> e.g, MPI_Comm_spawn. I see the user manual says this should work for the
> OFA-IB-CH3 interface:
>
> http://mvapich.cse.ohio-state.edu/static/media/mvapich/mvapich2-2.0-userguide.html#x1-350005.2.5
>
> I'm testing on a QLogic/PSM network but I've built using IB-CH3. I use
> mpirun_rsh with -rsh to run, and I get the following error:
>
> rzalastor34<53>: ./install/bin/mpirun_rsh -np 1 -hostfile hostfile -rsh
> ./spawn_test
> [rzalastor34:mpi_rank_0][rdma_find_network_type] QLogic IB card detected in
> system
> [rzalastor34:mpi_rank_0][rdma_find_network_type] Please re-configure the
> library with the '--with-device=ch3:psm' configure option for best
> performance
> [rzalastor34:mpirun_rsh][child_handler] Error in init phase, aborting! (1/2
> mpispawn connections)
>
> Is there a way to get more information?
> Thanks,
> -Adam
>
>
> I've configured MVAPICH2 with the following:
>
> rzalastor34<62>: ./install/bin/mpiname -a
> MVAPICH2 2.0 Fri Jun 20 20:00:00 EDT 2014 ch3:mrail
>
> Compilation
> CC: gcc -g -O0 -g -O0
> CXX: g++ -g -O0 -g
> F77: gfortran -L/lib -L/lib -g -O0 -fno-second-underscore -g -O0
> FC: gfortran -g -O0 -fno-second-underscore -g
>
> Configuration
> --prefix=/collab/usr/global/tools/mpi/sideinstalls/chaos_5_x86_64_ib/mvapich2-2.0-gnu-psm-debug-dpm/install
> --enable-f77 --enable-fc --enable-cxx --disable-fast --enable-g=dbg
> --enable-error-checking=runtime --enable-error-messages=all
> --enable-nmpi-as-mpi --enable-shared --enable-sharedlibs=gcc
> --enable-debuginfo --with-device=ch3:mrail --with-rdma=gen2 --enable-romio
> --with-file-system=lustre+nfs+ufs --disable-mpe --without-mpe
>
> The spawn_test.c file consists of this:
>
> #include <stdio.h>
> #include <stdlib.h>
>
> #include "mpi.h"
>
> int main(int argc, char* argv[])
> {
> MPI_Init(&argc, &argv);
>
> int errs[2];
> MPI_Comm comm;
> MPI_Comm_spawn("spawn_test2", MPI_ARGV_NULL, 1, MPI_INFO_NULL, 0,
> MPI_COMM_WORLD, &comm, errs);
> MPI_Barrier(comm);
>
> MPI_Finalize();
>
> return 0;
> }
>
> And the spawn_test2.c file has this:
>
> #include <stdio.h>
> #include <stdlib.h>
>
> #include "mpi.h"
>
> int main(int argc, char* argv[])
> {
> MPI_Init(&argc, &argv);
>
> int rank, ranks;
> MPI_Comm_rank(MPI_COMM_WORLD, &rank);
> MPI_Comm_size(MPI_COMM_WORLD, &ranks);
>
> printf("Rank %d Size %d\n", rank, ranks);
>
> MPI_Finalize();
>
> return 0;
> }
>
>
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mailman.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
--
Jonathan Perkins
More information about the mvapich-discuss
mailing list