This procedure updates liquid-cooled or standard rack compute node NIC mezzanine cards (NMC) firmware for Slingshot 10 Mellanox ConnectX-5 NICs. The deployed RPM on compute nodes contains the scripts and firmware images required to perform the firmware and configuration updates.
Attention: The NIC firmware update is performed while the node is running the compute image (in-band). Use the CX-5 NIC firmware that is deployed with the compute node RPMs and not from some other repository.
See Update Firmware with FAS for information about automated firmware updates using Redfish.
2-5 minutes for a firmware update and 1-3 minutes for a configuration update.
SSH to the node as root.
Load the module.
nid# module load cray-shasta-mlnx-firmware
nid# module show cray-shasta-mlnx-firmware
Example output:
-------------------------------------------------------------------
/opt/cray/modulefiles/cray-shasta-mlnx-firmware/1.0.5:
module-whatis "This module adds cray-shasta-mlnx-firmware v1.0.5 to the environment"
prepend-path PATH /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin
-------------------------------------------------------------------
List the contents of the firmware directories.
nid# ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/*
Example output:
apply_mlnx_configs generate_mlnx_configs update_mlnx_firmware
nid# ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/
Example output:
CRAY000000001/ MT_0000000011/ images/
nid# ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/
Example output:
CRAY000000001/ MT_0000000011/ images/
nid# ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/*
Example output:
/opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/CRAY000000001:
config.xml fw-ConnectX5-rel-16_26_4012-Cray_Timms_mezz_100G_1P-UEFI-14.19.17-FlexBoot-3.5.805.bin
/opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/MT_0000000011:
config.xml fw-ConnectX5-rel-16_26_4012-MCX515A-CCA_Ax-UEFI-14.19.17-FlexBoot-3.5.805.bin
/opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/images:
CRAY000000001.bin MT_0000000011.bin
Update the firmware on the node.
nid# update_mlnx_firmware
Apply the configuration settings.
nid# apply_mlnx_configs
Determine the prepend pathname.
nid# module show cray-shasta-mlnx-firmware
Example output:
-------------------------------------------------------------------
/opt/cray/modulefiles/cray-shasta-mlnx-firmware/1.0.5:
module-whatis "This module adds cray-shasta-mlnx-firmware v1.0.5 to the environment"
prepend-path PATH /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin
-------------------------------------------------------------------
Log in to ncn-m001
and use pdsh
to update the firmware.
ncn-m001# pdsh -w NODE_LIST /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin/update_mlnx_firmware
Apply the configuration settings.
ncn-m001# pdsh -w NODE_LIST /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin/apply_mlnx_configs
Use the Boot Orchestration Service (BOS) to reboot all the affected nodes.
ncn-m001# cray bos session create --template-uuid SESSION_TEMPLATE --operation reboot