The following covers restoring Slurm data.
To restore Slurm data from a backup, see Restore Slurm accounting database from a backup and Restore Slurm spool directory from a backup in the HPE Cray Supercomputing User Services Software Administration Guide: CSM on HPE Cray EX Systems (S-8063).
After restoring Slurm data from backup, check that the procedure was successful.
(uan#
) Check that accounting records were successfully restored. Use a start date from before the backup was taken.
sacct -a -S <date>
(uan#
) Check that the job queue was successfully restored.
squeue
(uan#
) Check that node states were successfully restored.
sinfo
sinfo --list-reasons