Use the Hardware State Manager (HSM) Cray CLI commands to disable nodes on the system.
Disabling nodes that are not configured correctly allows the system to successfully boot.
(ncn-mw#
) Disable one or more nodes with HSM.
cray hsm state components bulkEnabled update --enabled false --component-ids XNAME_LIST
(ncn-mw#
) Verify the desired nodes are disabled.
cray hsm state components query create --component-ids XNAME_LIST --format toml
Example output:
[[Components]]
Type = "Node"
Enabled = false
State = "On"
NID = 1003
Flag = "OK"
Role = "Compute"
NetType = "Sling"
Arch = "X86"
ID = "x5000c1s0b1n1"
[[Components]]
Type = "Node"
Enabled = false
State = "On"
NID = 1004
Flag = "OK"
Role = "Compute"
NetType = "Sling"
Arch = "X86"
ID = "x5000c1s0b1n2"
After changing the state of nodes, be cautious when powering them on/off. The preferred method for safely powering them on/off is via the Boot Orchestration Service (BOS). The Cray Advanced Platform Monitoring and Control (CAPMC) service is used to directly control the power for nodes, regardless of the state in HSM. CAPMC does not check if a node is disabled in HSM.