[mvapich-discuss] New errors (was can't read MPIRUN_PROCESSES)

Aquarijen aquarijen at gmail.com
Thu Mar 15 12:51:45 EDT 2007


Hi Matt,

Wow, Yes, that certainly changes things and I get different errors
now.  In my error file:
-----------------------------------------
[0:b09n010.xxx.gov] Abort: [2:b09n008.xxx.gov] Abort:
[1:b09n009.xxx.gov] Abort: [3:b09n007.xxx.gov] Abort:
[b09n010.xxx.gov:0] Got completion with error IBV_WC_LOC_LEN_ERR,
code=1
 at line 2381 in file viacheck.c
[b09n007.xxx.gov:3] Got completion with error IBV_WC_LOC_LEN_ERR, code=1
 at line 2381 in file viacheck.c
[3:b09n007.xxx.gov] Abort: [3] Got FATAL event
IBV_EVENT_QP_LAST_WQE_REACHED, code=16
 at line 2561 in file viacheck.c
[1:b09n009.xxx.gov] Abort: [1] Got FATAL event
IBV_EVENT_QP_LAST_WQE_REACHED, code=16
 at line 2561 in file viacheck.c
[b09n009.xxx.gov:1] Got completion with error IBV_WC_LOC_LEN_ERR, code=1
 at line 2381 in file viacheck.c
[2:b09n008.xxx.gov] Abort: [2] Got FATAL event
IBV_EVENT_QP_LAST_WQE_REACHED, code=16
 at line 2561 in file viacheck.c
[b09n008.xxx.gov:2] Got completion with error IBV_WC_LOC_LEN_ERR, code=1
 at line 2381 in file viacheck.c
done.
------------------------------------------------------------
Have you seen this before?  This is now similar to the problem that I
had been having in 0.9.8 that I was unable to figure out. :(

Thanks for all of your help!!!
-Jennifer


On 3/14/07, Matthew Koop <koop at cse.ohio-state.edu> wrote:
>
> > Maybe I should triple check from now on...
> > I was using the "mpirun" from 0.9.9, but not the "mpirun_rsh" from
> > 0.9.9.  Changing this got rid of the error message.  Now I get:
> > OSU MVAPICH VERSION 0.9.9-SingleRail
> > Build-ID: custom
> > in the error file - and this seems better!
> > But Now I get no output from the cpi or the hello++ program. :(  This
> > is what I get:
>
> > echo "Node file: $PBS_NODEFILE :"
>
> Jen,
>
> I think if you remove the "-v" from your mpirun_rsh line things should
> work for you. Also, you may need to be a bit careful with the
> $PBS_NODEFILE if you run more processes than node. With your example it
> should be fine, however.
>
> Let me know if this works.
>
> Matt
>
>


-- 
When I play with my cat, who knows whether she is not amusing herself
with me more than I with her.
Michel de Montaigne


More information about the mvapich-discuss mailing list