Bug 5348 - Suspending Nodes which are not in IDLE mode
Summary: Suspending Nodes which are not in IDLE mode
Status: RESOLVED INVALID
Alias: None
Product: Slurm
Classification: Unclassified
Component: slurmctld (show other bugs)
Version: 17.11.7
Hardware: Linux Linux
: --- 6 - No support contract
Assignee: Jacob Jenson
QA Contact:
URL:
Depends on:
Blocks:
 
Reported: 2018-06-25 03:32 MDT by Robert Klemm
Modified: 2020-05-15 07:28 MDT (History)
1 user (show)

See Also:
Site: -Other-
Alineos Sites: ---
Atos/Eviden Sites: ---
Confidential Site: ---
Coreweave sites: ---
Cray Sites: ---
DS9 clusters: ---
HPCnow Sites: ---
HPE Sites: ---
IBM Sites: ---
NOAA SIte: ---
OCF Sites: ---
Recursion Pharma Sites: ---
SFW Sites: ---
SNIC sites: ---
Linux Distro: ---
Machine Name:
CLE Version:
Version Fixed:
Target Release: ---
DevPrio: ---
Emory-Cloud Sites: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Robert Klemm 2018-06-25 03:32:00 MDT
Hey folks,

when using PowerSaving, nodes which state is not IDLE, but DOWN, DRAIN, FAIL or even in MAINT, get set into PowerSaving-Mode and will be suspended.
Comment 1 fzillner 2020-05-15 07:28:51 MDT
I'm having the same problem. OpenHPC slurm 18.08.8 on CentOS 7.7.

The only workaround I can think of right now is to double check in my suspend script if the node is really IDLE or not. But IMO slurm should only suspend the nodes that are idle or down, at least that's what's stated in the docs.