[mvapich-discuss] MVAPICH on large clusters - timeouts - any advice?

Greg Lindahl greg.lindahl at qlogic.com
Fri Feb 23 21:52:29 EST 2007


On Thu, Feb 22, 2007 at 06:30:51PM +0000, Jonathan Follows wrote:

> [chpcc022:14] Got completion with error, code=12, dest rank=78 at line 397 
> in file viacheck.c

My guess is that you've got some bad HCAs/cables/switches in your
cluster. Have you looked at the error counters?

-- greg



More information about the mvapich-discuss mailing list