Troubleshooting Installation Problems

The installation of the Cray System Management (CSM) product requires knowledge of the various nodes and switches for the HPE Cray EX system. The procedures in this section should be referenced during the CSM install for additional information on system hardware, troubleshooting, and administrative tasks related to CSM.

Topics

Reset root password on a LiveCD USB

If the root password on the LiveCD needs to be changed, then see Reset root Password on a LiveCD USB.

PXE boot troubleshooting

See Troubleshooting PXE Boot.

Restart network services and interfaces on NCNs

If an NCN shows any of these problems, then the network services and interfaces on that node might need to be restarted:

  • Interfaces not showing up
  • IP addresses not applying
  • Member/child interfaces not being included

See Restart network services and interfaces on NCNs.

Utility storage node installation troubleshooting

Sometimes a large OSD can be created which is a concatenation of multiple devices, instead of one OSD per device. In this case, the Ceph storage might need to be reinitialized.

See Troubleshooting Utility Storage Node Installation.

Ceph CSI troubleshooting

If there has been a failure to initialize all Ceph CSI components on ncn-s001, then the storage node cloud-init may need to be rerun.

See Troubleshooting Ceph CSI.

Postgres troubleshooting

  • Timeout on cray-sls-init-load during Install CSM Services due to Postgres cluster in SyncFailed state

See Postgres status SyncFailed.