This procedure updates liquid-cooled or standard rack compute node NIC mezzanine cards (NMC) firmware for Slingshot 10 Mellanox ConnectX-5 NICs. The deployed RPM on compute nodes contains the scripts and firmware images required to perform the firmware and configuration updates.
Attention: The NIC firmware update is performed while the node is running the compute image (in-band). Use the CX-5 NIC firmware that is deployed with the compute node RPMs and not from some other repository.
See Update Firmware with FAS for information about automated firmware updates using Redfish.
2-5 minutes for a firmware update and 1-3 minutes for a configuration update.
SSH to the node as root.
(nid#
) Load the module.
module load cray-shasta-mlnx-firmware
module show cray-shasta-mlnx-firmware
Example output:
-------------------------------------------------------------------
/opt/cray/modulefiles/cray-shasta-mlnx-firmware/1.0.5:
module-whatis "This module adds cray-shasta-mlnx-firmware v1.0.5 to the environment"
prepend-path PATH /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin
-------------------------------------------------------------------
(nid#
) List the contents of the firmware directories.
ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/*
Example output:
apply_mlnx_configs generate_mlnx_configs update_mlnx_firmware
ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/
Example output:
CRAY000000001/ MT_0000000011/ images/
ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/
Example output:
CRAY000000001/ MT_0000000011/ images/
ls /opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/*
Example output:
/opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/CRAY000000001:
config.xml fw-ConnectX5-rel-16_26_4012-Cray_Timms_mezz_100G_1P-UEFI-14.19.17-FlexBoot-3.5.805.bin
/opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/MT_0000000011:
config.xml fw-ConnectX5-rel-16_26_4012-MCX515A-CCA_Ax-UEFI-14.19.17-FlexBoot-3.5.805.bin
/opt/cray/cray-shasta-mlnx-firmware/1.0.5/share/firmware/images:
CRAY000000001.bin MT_0000000011.bin
(nid#
) Update the firmware on the node.
update_mlnx_firmware
(nid#
) Apply the configuration settings.
apply_mlnx_configs
(nid#
) Determine the prepend pathname.
module show cray-shasta-mlnx-firmware
Example output:
-------------------------------------------------------------------
/opt/cray/modulefiles/cray-shasta-mlnx-firmware/1.0.5:
module-whatis "This module adds cray-shasta-mlnx-firmware v1.0.5 to the environment"
prepend-path PATH /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin
-------------------------------------------------------------------
(ncn-m001#
) Log in to ncn-m001
and use pdsh
to update the firmware.
pdsh -w NODE_LIST /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin/update_mlnx_firmware
(ncn-m001#
) Apply the configuration settings.
pdsh -w NODE_LIST /opt/cray/cray-shasta-mlnx-firmware/1.0.5/sbin/apply_mlnx_configs
(ncn-m001#
) Use the Boot Orchestration Service (BOS) to reboot all the affected nodes.
cray bos v1 session create --template-name SESSION_TEMPLATE --operation reboot