On 17/04/2019 18.54, Yang Liu wrote:
> We often received errors due to socket time out on send/recv opeartion:
>
> slurm_load_jobs error: Socket timed out on send/recv operation
> slurm_load_node: Socket timed out on send/recv operation
>
>
> What could cause the errors? How likely job_submit.lu
We often received errors due to socket time out on send/recv opeartion:
slurm_load_jobs error: Socket timed out on send/recv operation
slurm_load_node: Socket timed out on send/recv operation
What could cause the errors? How likely job_submit.lua could cause such errors?
We have a program runni
On Friday, 19 October 2018 4:58:37 AM AEDT Kirk Main wrote:
> I'm a new administrator to Slurm and I've just got my new cluster up and
> running. We started getting a lot of "Socket timed out on send/recv
> operation" errors when submitting jobs, and also if you try to "squeue"
> while others are
Kirk,
MailProg=/usr/bin/sendmail
MailProg should be the program used to SEND mail ie. /bin/mail not
sendmail
If I am not wrong int he jargon MailProg is a MUA not an MTA (sendmail is
an MTA)
On Thu, 18 Oct 2018 at 19:01, Kirk Main wrote:
> Hi all,
>
> I'm a new administrator to Slurm a
Hi all,
I'm a new administrator to Slurm and I've just got my new cluster up and
running. We started getting a lot of "Socket timed out on send/recv
operation" errors when submitting jobs, and also if you try to "squeue"
while others are submitting jobs. The job does eventually run after about a
m