<head><!-- BaNnErBlUrFlE-HeAdEr-start -->
<style>
#pfptBannerjdw98db { all: revert !important; display: block !important;
visibility: visible !important; opacity: 1 !important;
background-color: #CFD3D7 !important;
max-width: none !important; max-height: none !important }
.pfptPrimaryButtonjdw98db:hover, .pfptPrimaryButtonjdw98db:focus {
background-color: #adb0b4 !important; }
.pfptPrimaryButtonjdw98db:active {
background-color: #8c8e91 !important; }
</style>
<!-- BaNnErBlUrFlE-HeAdEr-end -->
</head><!-- BaNnErBlUrFlE-BoDy-start -->
<!-- Preheader Text : BEGIN -->
<div style="display:none !important;display:none;visibility:hidden;mso-hide:all;font-size:1px;color:#ffffff;line-height:1px;height:0px;max-height:0px;opacity:0;overflow:hidden;">
Hi Akshay, Thank you so much for your help. I am testing new things with your library. If I can help you in any way, please let me know. I tried to start the spark cluster with the traditional script (. /sbin/start-all. sh) with the same result,
</div>
<!-- Preheader Text : END -->
<!-- Email Banner : BEGIN -->
<div style="display:none !important;display:none;visibility:hidden;mso-hide:all;font-size:1px;color:#ffffff;line-height:1px;height:0px;max-height:0px;opacity:0;overflow:hidden;">ZjQcmQRYFpfptBannerStart</div>
<!--[if ((ie)|(mso))]>
<table border="0" cellspacing="0" cellpadding="0" width="100%" style="padding: 16px 0px 16px 0px; direction: ltr" lang="en"><tr><td>
<table border="0" cellspacing="0" cellpadding="0" style="padding: 0px 10px 5px 6px; width: 100%; border-radius:4px; border-top:4px solid #8c8e91;background-color:#CFD3D7;"><tr><td valign="top">
<table align="left" border="0" cellspacing="0" cellpadding="0" style="padding: 4px 8px 4px 8px">
<tr><td style="color:#000000; font-family: 'Arial', sans-serif; font-weight:bold; font-size:14px; direction: ltr">
This Message Is From an External Sender
</td></tr>
<tr><td style="color:#000000; font-weight:normal; font-family: 'Arial', sans-serif; font-size:12px; direction: ltr">
This message came from outside your organization.
</td></tr>
</table>
<![if ie]><br clear="all"><![endif]>
<table align="right" border="0" cellspacing="0" cellpadding="0" style="padding: 4px 0px 4px 0px"><tr>
<td style="direction: ltr"> <a target="_blank" href="https://us-phishalarm-ewt.proofpoint.com/EWT/v1/KGKeukY!vYQd06pBw4oBSdX73OJkWxk97QaxYQXWciJPQDXl_d_Uhs_cNMn5Jltjwq8NHayNpiqmthJJRbFYmw1sc2wfSimD7NcOs6WTkkJgjFna39sQvLXNYU5ViK5Y4fC2JuinJl5Mhw$" style="mso-padding-alt: 7.5px; padding: 7.5px; border-radius: 2px; border: 1.5px solid #666666; "><strong style="font-weight: normal; color: #000000; text-decoration: none; font-family: 'Arial', sans-serif; font-size:14px; line-height: 40px; "> Report Suspicious </strong></a> </td>
</tr></table>
</td></tr></table>
</td></tr></table>
<![endif]-->
<![if !((ie)|(mso))]>
<div dir="ltr" lang="en" id="pfptBannerjdw98db" style="all: revert !important; display:block !important; text-align: left !important; margin:16px 0px 16px 0px !important; padding:8px 16px 8px 16px !important; border-radius: 4px !important; min-width: 200px !important; background-color: #CFD3D7 !important; background-color: #CFD3D7; border-top: 4px solid #8c8e91 !important; border-top: 4px solid #8c8e91;">
<div id="pfptBannerjdw98db" style="all: unset !important; float:left !important; display:block !important; margin: 0px 0px 1px 0px !important; max-width: 600px !important;">
<div id="pfptBannerjdw98db" style="all: unset !important; display:block !important; visibility: visible !important; background-color: #CFD3D7 !important; color:#000000 !important; color:#000000; font-family: 'Arial', sans-serif !important; font-family: 'Arial', sans-serif; font-weight:bold !important; font-weight:bold; font-size:14px !important; line-height:18px !important; line-height:18px">
This Message Is From an External Sender
</div>
<div id="pfptBannerjdw98db" style="all: unset !important; display:block !important; visibility: visible !important; background-color: #CFD3D7 !important; color:#000000 !important; color:#000000; font-weight:normal; font-family: 'Arial', sans-serif !important; font-family: 'Arial', sans-serif; font-size:12px !important; line-height:18px !important; line-height:18px; margin-top:2px !important;">
This message came from outside your organization.
</div>
</div>
<div id="pfptBannerjdw98db" style="all: unset !important; float: right !important; display: block !important; display: block; margin: 0px 0px 0px 16px !important; text-align: right !important; width: fit-content !important;">
<a id="pfptBannerjdw98db" href="https://us-phishalarm-ewt.proofpoint.com/EWT/v1/KGKeukY!vYQd06pBw4oBSdX73OJkWxk97QaxYQXWciJPQDXl_d_Uhs_cNMn5Jltjwq8NHayNpiqmthJJRbFYmw1sc2wfSimD7NcOs6WTkkJgjFna39sQvLXNYU5ViK5Y4fC2JuinJl5Mhw$"
style="all: unset !important; display: inline-block !important; text-decoration: none">
<div class="pfptPrimaryButtonjdw98db" style="display: inline-block !important; display: inline-block; visibility: visible !important; opacity: 1 !important; color: #000000 !important; color: #000000; font-family: 'Arial', sans-serif !important; font-family: 'Arial', sans-serif; font-size: 14px !important; font-weight: normal !important; text-decoration: none !important; border-radius: 2px !important; padding: 7.5px 16px !important; margin: 3px 0 3px 16px !important; white-space: nowrap !important; width: fit-content !important;
border: 1px solid #666666">
Report Suspicious
</div>
</a>
</div>
<div style="clear: both !important; display: block !important; visibility: hidden !important; line-height: 0 !important; font-size: 0.01px !important; height: 0px"> </div>
</div>
<![endif]>
<div style="display:none !important;display:none;visibility:hidden;mso-hide:all;font-size:1px;color:#ffffff;line-height:1px;height:0px;max-height:0px;opacity:0;overflow:hidden;">ZjQcmQRYFpfptBannerEnd</div>
<!-- Email Banner : END -->
<!-- BaNnErBlUrFlE-BoDy-end -->
<div dir="ltr">Hi Akshay,<div><br></div><div>Thank you so much for your help. I am testing new things with your library. If I can help you in any way, please let me know.</div><div><br></div><div>I tried to start the spark cluster with the traditional script (./sbin/start-all.sh) with the same result, for any reason no workers seem to be available.</div><div><br></div><div>On the other hand, I have downloaded the tarball you have available on the website, would you have a public repository where the source code is?<br></div><div><br></div><div>Thanks again for your help. Best regards.</div><div>Gabriel.</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">El mié, 12 jun 2024 a las 18:42, Paniraja Guptha, Akshay (<<a href="mailto:panirajaguptha.1@osu.edu">panirajaguptha.1@osu.edu</a>>) escribió:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="msg4112987599094189187">
<div lang="EN-US" style="overflow-wrap: break-word;">
<div class="m_4112987599094189187WordSection1">
<p class="MsoNormal"><span style="font-size:11pt">Hi Gabriel,<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11pt">Thanks for contacting us. <u></u>
<u></u></span></p>
<p class="MsoNormal"><span style="font-size:11pt">We are taking a look at this. We will get back to you once we have an update.<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11pt"><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:11pt">-Akshay<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11pt"><u></u> <u></u></span></p>
<div style="border-right:none;border-bottom:none;border-left:none;border-top:1pt solid rgb(225,225,225);padding:3pt 0in 0in">
<p class="MsoNormal"><b><span style="font-size:11pt;font-family:Calibri,sans-serif">From:</span></b><span style="font-size:11pt;font-family:Calibri,sans-serif"> Mvapich-discuss <mvapich-discuss-bounces+panirajaguptha.1=<a href="mailto:osu.edu@lists.osu.edu" target="_blank">osu.edu@lists.osu.edu</a>>
<b>On Behalf Of </b>GABRIEL SOTODOSOS MORALES via Mvapich-discuss<br>
<b>Sent:</b> Tuesday, June 11, 2024 6:57 AM<br>
<b>To:</b> <a href="mailto:mvapich-discuss@lists.osu.edu" target="_blank">mvapich-discuss@lists.osu.edu</a><br>
<b>Subject:</b> [Mvapich-discuss] Problems trying to run SparkPi example with MPI4Spark<u></u><u></u></span></p>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<p class="MsoNormal"><span style="font-size:1pt;color:white">Hi Mvapich-discuss, I´m trying to run the SparkPi example in my cluster using the Standalone Cluster Manager. However, my executor gets stuck when deploying the
tasks to the executors with the following message: "WARN TaskSchedulerImpl:</span><span style="font-size:1pt;font-family:Arial,sans-serif;color:white"> </span><span style="font-size:1pt;color:white">
<u></u><u></u></span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:1pt;color:white"><u></u><u></u></span></p>
</div>
<p class="MsoNormal">Hi Mvapich-discuss,<u></u><u></u></p>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I´m trying to run the SparkPi example in my cluster using the Standalone Cluster Manager. However, my executor gets stuck when deploying the tasks to the executors with the following message:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal"><i>"WARN TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources"</i><u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I have followed the steps in the user guide, I don´t know if I did something wrong or if I missed something. With the same configuration in Spark, I can run the SparkPi example without problems.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I am using MVAPICH-3.0 compiled as follows: <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">--prefix=/beegfs/home/javier.garciablas/gsotodos/bin_noref/mvapich/ --enable-threads=multiple --enable-romio --with-device=ch4:ofi:psm2 --with-libfabric=/opt/libfabric<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">And here are my configuration files:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">spark-env.sh:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">export SPARK_HOME=$HOME/mpi4spark-0.2-x86-bin<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">export SPARK_NO_DAEMONIZE=1<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">export JAVA_LIBRARY_PATH=$JAVA_LIBRARY_PATH:$MV2J_HOME<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$MV2J_HOME/lib<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">export SPARK_LIBRARY_PATH=$MV2J_HOME/lib<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">export JAVA_BINARY=$JAVA_HOME/bin<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">export WORK_DIR=$SPARK_HOME/exec-wdir<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">spark-defaults.conf:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">spark.executor.extraJavaOptions -Djava.library.path=$HOME/mvapich2-j-2.3.7/lib<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">app.sh:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">./bin/spark-submit --master spark://$1:7077 --class org.apache.spark.examples.SparkPi examples/jars/spark-examples_2.12-3.3.0-SNAPSHOT.jar 1024<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">sbin/start-mpi4spark.sh:<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">HOSTFILE=hostfile<br>
procs=`wc -l < ${HOSTFILE}`<br>
javac -cp $MV2J_HOME/lib/mvapich2-j.jar SparkMPI.java<br>
host=`tail -2 ${HOSTFILE} | head -1`<br>
<br>
{<br>
$MPILIB/bin/mpirun_rsh -export-all -np $procs -hostfile ${HOSTFILE} SLURM_JOB_ID=$SLURM_JOB_ID MV2_RNDV_PROTOCOL=RGET MV2_USE_RDMA_FAST_PATH=0 MV2_USE_COALESCE=0 MV2_SUPPORT_DPM=1 MV2_HOMOGENEOUS_CLUSTER=1 MV2_ENABLE_AFFINITY=0 LD_PRELOAD= $MPILIB/lib/libmpi.so
java -cp $MV2J_HOME/lib/mvapich2-j.jar:. -Djava.library.path=$MV2J_HOME/lib SparkMPI $host<br>
} >& exec.log<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">After launching sbin/start-mpi4spark.sh the master and workers nodes keep alive but the execution gets stuck as said before. Am I missing something? Thanks for the help in advance.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Best regads.<u></u><u></u></p>
</div>
<p class="MsoNormal">Gabriel.<u></u><u></u></p>
</div>
</div>
</div></blockquote></div>