The following covers restoring Slurm data.
To restore Slurm data from backup, follow sections 10.3.11 Restore Slurm Accounting Database from Backup and 10.3.13 Restore Slurm Spool Directory from Backup in the HPE Cray Programming Environment Installation Guide: CSM on HPE Cray EX Systems (S-8003).
After restoring Slurm data from backup, check that the procedure was successful.
(uan#
) Check that accounting records were successfully restored. Use a start date from before the backup was taken.
sacct -a -S <date>
(uan#
) Check that the job queue was successfully restored.
squeue
(uan#
) Check that node states were successfully restored.
sinfo
sinfo --list-reasons