The ability to ignore non-compute nodes (NCNs) is turned off by default. Management nodes and NCNs are also not locked by
default. The administrator must lock the NCNs and their BMCs to prevent unwanted actions from affecting these nodes. To more
easily identify the BMCs that are associated with the management nodes, they need to be marked with the Management role in
the Hardware State Manager (HSM), just like their associated nodes.
This section only covers marking BMCs of management nodes with the Management role using HSM.
For more information on locking or ignoring nodes, refer to the following sections:
The Cray Command Line Interface must be configured on the NCN where this procedure is being performed. See Configure the Cray CLI.
The BMCs of NCNs should be marked with the Management role as early as possible in the install/upgrade cycle to prevent unintentionally taking down a critical node.
The Management role on the BMCs cannot be set until after Kubernetes is running and the HSM service is operational.
(ncn-mw#) Check whether HSM is running with the following command:
kubectl -n services get pods | grep smd
Example output:
cray-smd-848bcc875c-6wqsh 2/2 Running 0 9d
cray-smd-848bcc875c-hznqj 2/2 Running 0 9d
cray-smd-848bcc875c-tp6gf 2/2 Running 0 6d22h
cray-smd-init-2tnnq 0/2 Completed 0 9d
cray-smd-postgres-0 2/2 Running 0 19d
cray-smd-postgres-1 2/2 Running 0 6d21h
cray-smd-postgres-2 2/2 Running 0 19d
cray-smd-wait-for-postgres-4-7c78j 0/3 Completed 0 9d
The cray-smd pods need to be in the Running state, except for cray-smd-init and
cray-smd-wait-for-postgres which should be in Completed state.
Use the cray hsm state components bulkRole update command to perform setting roles on the BMC.
(ncn-mw#) Get the list of BMCs of management nodes.
BMCList=$(cray hsm state components list --role Management --type Node --format json | jq -r .Components[].ID | \
sed 's/n[0-9]*//' | tr '\n' ',' | sed 's/.$//')
echo ${BMCList}
Example output:
x3000c0s1b0,x3000c0s2b0,x3000c0s3b0,x3000c0s6b0,x3000c0s5b0,x3000c0s4b0,x3000c0s7b0,x3000c0s8b0,x3000c0s9b0
(ncn-mw#) Set the Management role for those BMCs.
cray hsm state components bulkRole update --role Management --component-ids "${BMCList}"
This command gives no output when it completes successfully.
(ncn-mw#) Set the Management role for specific BMCs.
cray hsm state components bulkRole update --role Management --component-ids x3000c0s8b0
This command gives no output when it completes successfully.