[OOD-users] Jupyter App -- satisfy password request && database connection error

E.M. Dragowsky dragowsky at case.edu
Fri Sep 28 17:54:50 EDT 2018


OK -- part 2 of my message is attributed to the slurmdbd service stopping.
Our slurm began experiencing an event reported earlier in this note:
https://bugs.schedmd.com/show_bug.cgi?id=2066
<https://www.google.com/url?q=https://bugs.schedmd.com/show_bug.cgi?id%3D2066&sa=D&source=hangouts&ust=1538257248232000&usg=AFQjCNHPfW7HzDgAxRwb-RwYobMxX7qeGg>

Effectively, 37GB of "packmem" "pack32" "pack16" errors written starting at
14:51pm today were generated and brought down the service by 15:30.

Monday I'll resume with understanding how to manage the password for the
Jupyter app -- and keep a careful eye on the interaction with slurmdbd....

On Fri, Sep 28, 2018 at 4:49 PM, Shawn Doughty <shawn.doughty at tufts.edu>
wrote:

> Hi,
>
> You mean jupyter notebook? I used the regular batchconnect version from
> OOD git and then just tweaked a couple of settings using native slurm
> connection. This shouldn't have to go over TurboVNC?   Happy to send you a
> copy of our setup here.  Users are really liking jupyter, the only thing I
> noticed is it has a tendency to barf when running out of memory which is
> why we give users the choice to pick cores and memory.
>
> On Fri, Sep 28, 2018 at 4:49 PM Shawn Doughty <capsaicin420 at gmail.com>
> wrote:
>
>> Hi,
>>
>> You mean jupyter notebook? I used the regular batchconnect version from
>> OOD git and then just tweaked a couple of settings using native slurm
>> connection. This shouldn't have to go over TurboVNC?   Happy to send you a
>> copy of our setup here.  Users are really liking jupyter, the only thing I
>> noticed is it has a tendency to barf when running out of memory which is
>> why we give users the choice to pick cores and memory.
>>
>> On Fri, Sep 28, 2018 at 4:25 PM E.M. Dragowsky <dragowsky at case.edu>
>> wrote:
>>
>>> Hi, everyone --
>>>
>>> My Jupyter development app is now configured and functional, such that
>>> the app launches and connects to the compute node I have provisioned with
>>> TurboVNC and websockify (still testing).
>>>
>>> Being after 4pm local time, I have two problems that are preventing the
>>> actual connection.
>>> --  After launching a session, a "Connect to Jupyter" button is
>>> presented.  After clicking through, I don't know how to satisfy the
>>> password request in the browser window. Post connection documentation
>>> doesn't make enough sense to me.
>>>
>>> --  Perhaps related to my failed password attempts, I'm now also
>>> experiencing a database connection error preventing any further evaluation:
>>>
>>>
>>> sbatch: error: slurm_persist_conn_open_without_init: failed to open persistant connection to smaster:6819: Connection refused
>>> sbatch: error: slurmdbd: Sending PersistInit msg: Connection refused
>>> sbatch: error: slurm_persist_conn_open_without_init: failed to open persistant connection to smaster:6819: Connection refused
>>> sbatch: error: slurmdbd: Sending PersistInit msg: Connection refused
>>> sbatch: error: slurmdbd: DBD_GET_CLUSTERS failure: Connection refused
>>> sbatch: error: Problem talking to database
>>> sbatch: error: There is a problem talking to the database: Connection refused.  Only local cluster communication is available, remove --cluster from your command line or contact your admin to resolve the problem.
>>>
>>>
>>> smaster is the name of my cluster, and we're using slurm.
>>>
>>> Thanks in advance
>>> ~ Emily
>>>
>>> --
>>> E.M. Dragowsky, Ph.D.
>>> Research Computing -- UTech
>>> Case Western Reserve University
>>> (216) 368-0082
>>>
>>
>
> --
> Shawn G. Doughty
> Manager - Research Computing
> Tufts Technology Services (TTS)
> Research Technology (RT)
> 16 Dearborn Road
> Somerville, MA 02144
> 617-627-5462
> http://it.tufts.edu/
> <https://exchange.tufts.edu/owa/redir.aspx?C=I4ijzFrF9U-qoUscfQAAw3PH17xA188ItQ-l2VPx05symHhOFT2FNSr8FiNXjMnyEtzrTlLLj-Q.&URL=http%3a%2f%2fit.tufts.edu%2f>
>
> _______________________________________________
> OOD-users mailing list
> OOD-users at lists.osc.edu
> https://lists.osu.edu/mailman/listinfo/ood-users
>
>


-- 
E.M. Dragowsky, Ph.D.
Research Computing -- UTech
Case Western Reserve University
(216) 368-0082
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osu.edu/pipermail/ood-users/attachments/20180928/1d922a4f/attachment.html>


More information about the OOD-users mailing list