[mvapich-discuss] MPD related error

Dhabaleswar Panda panda at cse.ohio-state.edu
Sat Jul 5 08:18:06 EDT 2008


Thanks for your note. You might have noticed that a new version of
MVAPICH2 (1.2RC1) was released a few days back. This release has a non-MPD
(daemon-less) startup scheme. This new start-up scheme is applicable for
all interfaces including uDAPL. I will suggest you to upgrade your
software stack to this release. It will provide you faster start-up and
you need not worry about the MPD-related issues.

DK


On Sat, 5 Jul 2008, yogeshwar sonawane wrote:

> Hi all,
>
> I am trying to run 64 processes using MVAPICH2-1.0.1-uDAPL on 8 nodes.
> Every node has 8 cores/cpus.
>
> Out of 64, sometimes one or more processes gets killed or closed. The
> node on which there are less than 8 processes running has following
> message which comes in /var/log/messages file :-
>
> Jul  4 13:23:05 pn02 mpdman: pn02_mpdman_12: mpd_uncaught_except_tb
> handling:   exceptions.AttributeError: 'int'
> object has no attribute 'send_dict_msg'
> /home/htdg/pn_mpi/mpi-bin_send-recv_pnet3/bin/mpdman.py  652
> handle_lhs_input         self.ring.rhsSock.send_dict_msg(msg)
> /home/htdg/pn_mpi/mpi-bin_send-recv_pnet3/bin/mpdlib.py  743
> handle_active_streams         handler(stream,*args)
> /home/htdg/pn_mpi/mpi-bin_send-recv_pnet3/bin/mpdman.py  481  run
> rv = self.streamHandler.handle_active_streams(timeout=5.0)
> /home/htdg/pn_mpi/mpi-bin_send-recv_pnet3/bin/mpd.py  1408
> launch_mpdman_via_fork         mpdman.run()
> /home/htdg/pn_mpi/mpi-bin_send-recv_pnet3/bin/mpd.py  1325
> run_one_cli         (manPid,toManSock) =
> self.launch_mpdman_via_fork(msg,man_env)
>    /home/htdg/pn_mpi/mpi-bin_send-recv_pnet3/bin/mpd.py  1199
> do_mpdrun         self.run_one_cli(lorank,msg)
>  /home/htdg/pn_mpi/mpi-bin_send-recv_pnet3/bin/mpd.py  854
> handle_lhs_input         self.do_mpdrun(msg)     /home/htdg
>
> Can anybody give me some more info about this ?
> Is this some kind of setup/settings issue on nodes ?
>
> Thanks,
> Yogeshwar
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>



More information about the mvapich-discuss mailing list