[mvapich-discuss] Help with polled desc error revisited

Craig Tierney Craig.Tierney at noaa.gov
Tue Jul 22 17:53:17 EDT 2008


Sorry to followup my own message.  I found the result of the
previous thread.  Clearing MV2_USE_RING_STARTUP solves the issue.

Craig




Craig Tierney wrote:
> Back in January 2008, there was a thread about some users getting
> the following error messages:
> 
> [9] Abort: Error code in polled desc!
>  at line 1229 in file rdma_iba_priv.c
> 
> I did not see a resolution to this problem.  Did anyone
> find a solution?
> 
> I am trying to run some applications that were compiled under
> the following setup:
> 
> Dual-core Woodcrest, RHAS 4.4, Intel 9.1, OFED 1.2.5.1, Mvapich2-1.0
> 
> I am now trying to run these applications under a new environment:
> 
> Quad-core Harpertown, Centos 5.1, Intel 9.1, OFED 1.3.1, Mvapich2-1.0
> 
> Our goal is to minimize the changes in the environment, while getting
> on a new OS base.
> 
> Some executables that ran properly on the Woodcrest system do not
> run on the Harpertown system.  Not all codes exhibit this problem.
> If the codes are run on 4 cores per host (vs. 8), they launch correctly.
> 
> I have tried tweaking the limit for stacksize (because we generally set
> them to unlimited), but this did not help.
> 
> I did find that using MV2_USE_SHAM=1 on the Harpertown image caused codes
> to fail when sending certain sized messages.  I wonder if there is
> another variable that could affect startup.
> 
> Thanks,
> Craig
> 
> 
> 


-- 
Craig Tierney (craig.tierney at noaa.gov)


More information about the mvapich-discuss mailing list