[mvapich-discuss] MVAPICH-0.9.8 lockup with OFED-1.1-rc3
Andrew Dobbie
adobbie at cims.carleton.ca
Wed Sep 13 11:45:26 EDT 2006
Hi Pavel,
It does appear that the problem was the hosts file. Thanks for your
quick solution, it is much appreciated. Would it be possible for future
versions to check for configuration mistakes such as these?
-Andrew
On Wed, 2006-09-13 at 18:10 +0300, Pavel Shamis (Pasha) wrote:
> > The hosts file is identical for all machines except for the 127.0.0.1
> > entry. Will this cause a problem?
> > 127.0.0.1 ND01 localhost.localdomain localhost
>
> I think the line is the exactly source of the problem.
> It should be:
> 127.0.0.1 localhost.localdomain localhost
>
> The hosts file should be:
> 127.0.0.1 localhost.localdomain localhost
> 192.168.1.80 FLSRVR
> 192.168.8.1 ND01
> 192.168.8.2 ND02
> 192.168.8.3 ND03
> 192.168.8.4 ND04
> 192.168.8.5 ND05
> 192.168.8.6 ND06
>
> Please copy the file to all nodes and I believe the mpi will run :)
>
More information about the mvapich-discuss
mailing list