[mvapich-discuss] Hang in CH3 SMP Rendezvous protocol w/ CUDA w/o Infiniband

Filippo Spiga spiga.filippo at gmail.com
Mon Feb 2 00:55:22 EST 2015


On Jan 22, 2015, at 2:46 PM, Paul Sathre <sath6220 at cs.vt.edu> wrote:
> The hanging system has 4x Tesla C2070s running Nvidia driver 319.32 and libibverbs 1.1.6 (I have tested swapping in libibverbs 1.1.8 and gcc 4.8 to make it more like the successful systems, to no avail. Vimdiff examination of the config.log of the failing system vs. either succeeding system shows no significant changes.)

I m not 100% sure but based on what I understood you need Kepler cards to use GPU Direct over RDMA.

F

--
Mr. Filippo SPIGA, M.Sc.
http://filippospiga.info ~ skype: filippo.spiga

«Nobody will drive us out of Cantor's paradise.» ~ David Hilbert

*****
Disclaimer: "Please note this message and any attachments are CONFIDENTIAL and may be privileged or otherwise protected from disclosure. The contents are not to be disclosed to anyone other than the addressee. Unauthorized recipients are requested to preserve this confidentiality and to advise the sender immediately of any error in transmission."





More information about the mvapich-discuss mailing list