[mvapich-discuss] Problem with fabric combining DDR and SDR cards

Craig Tierney Craig.Tierney at noaa.gov
Wed Apr 30 12:37:36 EDT 2008


Matthew Koop wrote:
> Craig,
> 
> So are you running with MVAPICH2? Currently MVAPICH2 will require an
> additional environment variable when using cards of different types:
> 
> MV2_DEFAULT_MTU=IBV_MTU_1024
> 
> We will be adding support for cards of different speeds and cards.
> MVAPICH 1.0 already has this support.
> 
> Let us know if this does not help,
> 

 >

I am running MVAPICH2.  Specifying any IBV_MTU_* setting (256,512,1024,2048)
solves the problem for a small program (HPL).

Why is this setting needed?  Are there any performance issues with
setting this value?  Why not just use the IBV_MTU_2048 variable?

Thansk,
Craig



> Matt
> 
> On Fri, 25 Apr 2008, Craig Tierney wrote:
> 
>> I have a SDR based fabric running OFED-1.2.5.1 and MVAPICH (both
>> 1.0 and 1.0.2p1).  My vendor sent a DDR card as a replacement
>> for a failed SDR and said 'it should just work'.  I tried to use
>> it, but I am not able to run jobs.  I get the following error
>> as codes startup:
>>
>> send desc error
>> [0] Abort: [] Got completion with error 9, vendor code=8a, dest rank=2
>>   at line 513 in file ibv_channel_manager.c
>> rank 0 in job 1  w347_44628   caused collective abort of all ranks
>>    exit status of rank 0: killed by signal 9
>>
>> The codes are able to start (for isntance HPL is able to its headers).
>> This problem happens using both 1.0 and 1.0.2p1.  It does not happen
>> with OpenMPI-1.2.4.
>>
>> Should I be able to combine DDR and SDR cards in the same fabric and
>> run jobs across them?  Are there any performance issues with this
>> (not with things running at DDR, but running worse than SDR)?
>>
>> Thanks,
>> Craig
>> --
>> Craig Tierney (craig.tierney at noaa.gov)
>> _______________________________________________
>> mvapich-discuss mailing list
>> mvapich-discuss at cse.ohio-state.edu
>> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>>
> 
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
> 


-- 
Craig Tierney (craig.tierney at noaa.gov)


More information about the mvapich-discuss mailing list