Re: [slurm-users] Having Issue in Slurm cluster setup

2019-04-08 Thread sudhagar s
Attaching my slurm.conf file. can you please help me to find the issue. On Tue, Apr 9, 2019 at 12:08 PM Ole Holm Nielsen wrote: > On 09-04-2019 08:33, sudhagar s wrote: > > Thanks Ole, > > > > when i give "scontrol show node" it list down the details. where i can > > see RealMemory=1 is this wil

Re: [slurm-users] Having Issue in Slurm cluster setup

2019-04-08 Thread sudhagar s
i didnt place any additional GPU card. i run this z840 workstation with default GPU (p2000) which is used for display(VGA). This might be the reason for this error then? On Tue, Apr 9, 2019 at 12:01 PM Ole Holm Nielsen wrote: > On 09-04-2019 08:25, sudhagar s wrote: > > Thanks For the respons

Re: [slurm-users] Having Issue in Slurm cluster setup

2019-04-08 Thread Ole Holm Nielsen
On 09-04-2019 08:33, sudhagar s wrote: Thanks Ole, when i give "scontrol show node" it list down the details. where i can see RealMemory=1 is this will be a problem? In your "scontrol show node" image I read RealMemory=1 (units of MB) and mem=1M. I think you configured slurm.conf incorrectl

Re: [slurm-users] Having Issue in Slurm cluster setup

2019-04-08 Thread sudhagar s
Thanks Ole, when i give "scontrol show node" it list down the details. where i can see RealMemory=1 is this will be a problem? On Tue, Apr 9, 2019 at 11:53 AM Ole Holm Nielsen wrote: > On 09-04-2019 07:37, sudhagar s wrote: > > Hi, Iam newbee in slurm. trying to setup a cluster for ML trainin

Re: [slurm-users] Having Issue in Slurm cluster setup

2019-04-08 Thread Ole Holm Nielsen
On 09-04-2019 08:25, sudhagar s wrote: Thanks For the response. here is my node  and partition information: Well, 1 MB of real memory in the node is not a lot :-) This reminds me of the very old days where PCs had 640 kB RAM... On Tue, Apr 9, 2019 at 11:53 AM Ole Holm Nielsen mailto:ole.h.

Re: [slurm-users] Having Issue in Slurm cluster setup

2019-04-08 Thread Ole Holm Nielsen
On 09-04-2019 07:37, sudhagar s wrote: Hi, Iam newbee in slurm. trying to setup a cluster for ML training purpose. i created controle node and compute node. both are up and running. when i enter "srun -N 1 hostname" it says " srun error memory specification can not be satisfied" "unable to allo

[slurm-users] Having Issue in Slurm cluster setup

2019-04-08 Thread sudhagar s
Hi, Iam newbee in slurm. trying to setup a cluster for ML training purpose. i created controle node and compute node. both are up and running. when i enter "srun -N 1 hostname" it says " srun error memory specification can not be satisfied" "unable to allocate resources: requested node configurati

Re: [slurm-users] slurmdbd purge not working

2019-04-08 Thread Lech Nieroda
Hello Julien, the innodb engine may stop working if you change parameters such as innodb_log_file_size without rebuilding the database, as the expected values no longer correspond to the encountered ones. Try using the old parameters. In order to debug the archive dump error you might want to ru

Re: [slurm-users] slurmdbd purge not working

2019-04-08 Thread Julien Rey
Hi Ole, Thank you for your advice. As I said in my previous messages, this is how I set the my.cnf: innodb_buffer_pool_size = 32G innodb_log_file_size= 64M innodb_lock_wait_timeout= 3600 I have read the thread "Extreme long db upgrade 16.05.6 -> 17.11.3". However I