[mvapich-discuss] mvapich2 -> rdma_setup_startup_ring(410): cannot create cq

Jim Galarowicz jeg at krellinst.org
Sun Nov 1 17:02:22 EST 2015


Hi everyone,

I'm running on a small cluster that has slurm and mvapich2 version 2.1 
installed.
However, I'm seeing this error when I try to run a simple mpi application.

    /srun -n 2 --mpi=pmi2 ./nbody-mvapich2//
    / /
    //In: PMI_Abort(1, Fatal error in MPI_Init://
    //Other MPI error, error stack://
    //MPIR_Init_thread(514).......: //
    //MPID_Init(367)..............: channel initialization failed//
    //MPIDI_CH3_Init(492).........: //
    //MPIDI_CH3I_RDMA_init(224)...: //
    //rdma_setup_startup_ring(410): cannot create cq//
    //)//
    //In: PMI_Abort(1, Fatal error in MPI_Init://
    //Other MPI error, error stack://
    //MPIR_Init_thread(514).......: //
    //MPID_Init(367)..............: channel initialization failed//
    //MPIDI_CH3_Init(492).........: //
    //MPIDI_CH3I_RDMA_init(224)...: //
    //rdma_setup_startup_ring(410): cannot create cq//
    //)//
    /



I searched the internet and found this url 
(http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/2012-September/004027.html) 
on the "cannot create cq" issue, which suggests we need to set

ulimit -l unlimited  in  /etc/sysconfig/slurm

> If it doesn't show unlimited (or some other number much higher than 64)
> then you'll need to do something to update the limits slurm is using.
> On redhat systems you can put the following in /etc/sysconfig/slurm.
>
>      ulimit -l unlimited
So, I added that file with the "ulimit -l unlimited" statement added.
But, it didn't seem to make any difference on the issue.

Does anyone have any hints on what might be wrong?

Thank you,
Jim G




-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20151101/271a6ac8/attachment.html>


More information about the mvapich-discuss mailing list