Reminder: If any problems are encountered and the procedure or command output does not provide relevant guidance, see Relevant troubleshooting links for upgrade-related issues.
ncn-m002
Set the CSM_RELEASE
variable to the target CSM version of this upgrade.
ncn-m002# CSM_RELEASE=csm-1.2.0
Follow either the Direct download or Manual copy procedure.
tar
file that is accessible from ncn-m002
, then the Direct download procedure may be used.tar
file to ncn-m002
.Set the ENDPOINT
variable to the URL of the directory containing the CSM release tar
file.
In other words, the full URL to the CSM release tar
file must be ${ENDPOINT}${CSM_RELEASE}.tar.gz
NOTE This step is optional for Cray/HPE internal installs, if ncn-m002
can reach the internet.
ncn-m002# ENDPOINT=https://put.the/url/here/
Run the script.
NOTE For Cray/HPE internal installs, if ncn-m002
can reach the internet, then the --endpoint
argument may be omitted.
ncn-m002# /usr/share/doc/csm/upgrade/1.2/scripts/upgrade/prepare-assets.sh --csm-version ${CSM_RELEASE} --endpoint "${ENDPOINT}"
Skip the Manual copy
subsection and proceed to Perform upgrade.
Copy the CSM release tar
file to ncn-m002
.
Set the CSM_TAR_PATH
variable to the full path to the CSM tar
file on ncn-m002
.
The
prepare-assets.sh
script will delete the CSM tarball in order to free space on the node. If not wanting the tarball file to be deleted for other reasons, then copy the tarball file to a different location, and setCSM_TAR_PATH
to point to this new location.
ncn-m002# CSM_TAR_PATH=/path/to/${CSM_RELEASE}.tar.gz
Run the script.
ncn-m002# /usr/share/doc/csm/upgrade/1.2/scripts/upgrade/prepare-assets.sh --csm-version ${CSM_RELEASE} --tarball-file "${CSM_TAR_PATH}"
Apply a workaround for the boot order and to ensure that disk boot entries appear in the BIOS boot selection menu.
ncn-m002# /usr/share/doc/csm/scripts/workarounds/boot-order/run.sh
During this stage there will be a brief (approximately five minutes) window where pods with Persistent Volumes (PV
s) will not be able to migrate between nodes.
This is due to a redeployment of the Ceph csi
provisioners into namespaces, in order to accommodate the newer charts and a better upgrade strategy.
Set the SW_ADMIN_PASSWORD
environment variable.
Set it to the admin
user password for the switches. This is required for post-upgrade tests.
read -s
is used to prevent the password from being written to the screen or the shell history.
ncn-m002# read -s SW_ADMIN_PASSWORD
ncn-m002# export SW_ADMIN_PASSWORD
Perform the upgrade.
Run csm-upgrade.sh
to deploy upgraded CSM applications and services.
ncn-m002# /usr/share/doc/csm/upgrade/1.2/scripts/upgrade/csm-upgrade.sh
cray-shared-kafka
was updated properlyOccasionally the cray-shared-kafka-kafka
pods will be restarted before the
cray-shared-kafka-zookeeper
pods are ready. Check to make sure that all
cray-shared-kafka-kafka
and cray-shared-kafka-zookeeper
pods have a READY
status
of 1/1
. If any of them have a 2/2
, then run the kafka-restart.sh
script.
ncn-m002# kubectl get pods -n services -l app.kubernetes.io/instance=cray-shared-kafka
Expected output:
NAME READY STATUS RESTARTS AGE
cray-shared-kafka-entity-operator-7f9895897d-zjgkm 3/3 Running 0 12m
cray-shared-kafka-kafka-0 2/2 Running 0 10m
cray-shared-kafka-kafka-1 2/2 Running 0 10m
cray-shared-kafka-kafka-2 2/2 Running 0 10m
cray-shared-kafka-zookeeper-0 1/1 Running 0 8m
cray-shared-kafka-zookeeper-1 1/1 Running 0 8m
cray-shared-kafka-zookeeper-2 1/1 Running 0 8m
In this example, because some of the pods are 2/2
, the kafka-restart.sh
script must be run.
ncn-m002# /usr/share/doc/csm/upgrade/1.2/scripts/strimzi/kafka-restart.sh
Verify that the Keycloak users localize job has completed as expected.
This section can be skipped if user localization is not required.
After an upgrade, it is possible that all expected Keycloak users were not localized. See Verification procedure to confirm that Keycloak localization has completed as expected.
This stage is completed. Continue to Stage 4.