The management nodes serve Network Time Protocol (NTP) at stratum 10, except for ncn-m001
, which serves at stratum 8 (or lower if an upstream NTP server is set). All management nodes peer with each other.
Until an upstream NTP server is configured, the time on the NCNs may not match the current time at the site, but they will stay in sync with each other.
If nodes are missing metadata for NTP, then the data must be generated using csi
and the system’s system_config.yaml
file.
The csi
tool is not available on ncn-m001
after the CSM install is completed. However, if the install recovery data is still available on ncn-m001
or ncn-m003
,
then the csi
tool can be retrieved from the saved PIT ISO file. To do this, see the step used to obtain access to CSI in the
Enable NCN Disk Wiping Safeguard procedure.
If the seed data from system_config.yaml
is not available, then open a support ticket to help generate the NTP data.
The following steps are structured to be executed on one node at a time. However, step #3 will generate all relevant files for each node. If multiple nodes are missing NTP data in BSS, then apply this fix to each node.
Update system_config.yaml
to have the correct NTP settings:
ntp-servers:
- ncn-m001
- example.upstream.ntp.server
ntp-timezone: UTC
Generate new configurations:
csi config init
Change directory to the newly created system/basecamp
directory and execute the upgrade_ntp_timezone_metadata.sh
script.
cd system/basecamp && /usr/share/doc/csm/upgrade/scripts/upgrade_ntp_timezone_metadata.sh
Find the relevant file for the node with missing metadata (such as upgrade-metadata-000000000000.json
) based on the MAC address of the node.
Find the component name (xname) for the node that needs to be fixed:
Run this command on the node that needs to be fixed in order to determine its xname.
cat /etc/cray/xname
From ncn-m001
, update BSS:
csi handoff bss-update-cloud-init --user-data="upgrade-metadata-000000000000.json" --limit=<xname>`
Continue with the upgrade.
Set a token as described in Identify Nodes and Update Metadata
When the upgrade is completed, run this script on ncn-m001
in order to ensure the time is set correctly on all NCNs:
for i in $(grep -oP 'ncn-\w\d+' /etc/hosts | sort -u); do
ssh $i "TOKEN=$TOKEN /srv/cray/scripts/common/chrony/csm_ntp.py"; done
Clock sync is performed in increments instead of all at once, so it may take some time for the clocks to sync.
Before executing any commands, give the nodes some time to update. Sync typically happens within a few seconds, but on
occasion could up to 30 or more minutes. Periodically running chronyc tracking
will show clock statistics and can be
used to determine if the clocks are gradually syncing.
On each affected NCN run the following:
Set a token as described in Identify Nodes and Update Metadata.
Export the token.
export TOKEN
Run the script:
/srv/cray/scripts/common/chrony/csm_ntp.py