This document provides background information about the NCNs (non-compute nodes) which function as management nodes for the HPE Cray EX system. This information is not normally needed to install software, but provides background which may be helpful for troubleshooting an installation.
cloud-init
Basecamp configurationThe Cray Site Init (csi
) command has several files which describe pre-configuration data needed during
the installation process:
application_node_config.yaml
cabinets.yaml
hmn_connections.json
ncn_metadata.csv
switch_metadata.csv
In addition, after being run with those files, csi
creates an output system_config.yaml
file which can be used as an input to csi
when reinstalling this software release.
See Cray Site Init Files for more information about these files.
While a system is being installed for the first time, a certificate authority (CA) is needed. This can be generated for a system, or one can be supplied from a customer intermediate CA. Outside of a new installation, there is no supported method to rotate or change the platform CA in this release.
For more information about these topics, see Certificate Authority.
The management nodes boot from NCN images which are created as layers on top of a common base image. The common image is customized with a Kubernetes layer for the master nodes and worker nodes. The common image is also customized with a storage/Ceph layer for the utility storage nodes. Three artifacts are needed to boot the management nodes.
For more information, see NCN Images.
The boot workflow for management nodes (NCNs) is different from compute nodes or application nodes. They can PXE boot over the network or from local storage.
See NCN Boot Workflow for more information.
Non-compute nodes and compute nodes have different network interfaces used for booting.
For more information, see NCN Networking.
The management nodes have specific file systems and mounts and use overlayfs
.
For information, see NCN Mounts and File Systems.
The management nodes boot from images which have many (RPM) packages installed. The packages installed differ between the Kubernetes master and worker nodes versus the utility storage nodes.
For more information, see NCN Packages.
All management nodes have an operating system based on SLE_HPC
(SuSE High Performance Computing).
For more information, see NCN Operating System Releases.
cloud-init
Basecamp configurationMetal Basecamp is a cloud-init
DataSource
available on the LiveCD. Basecamp’s configuration file offers many inputs for various cloud-init
scripts embedded within the NCN images.
For more information, see cloud-init
Basecamp Configuration.