Hi team We just had an incident due to a config issue. We have been rolling out an updated slurm config to all our infrastructure and reloaded the daemons. due to a syntax error we lost the config on all nodes and slurm was unusable. while the normal compute continued to run, all VDI session died which was a major impact. would it be possible to add a config check functionality like apachectl configtest or sshd -t to slurm*d? this could be run for validation before actually reloading the daemons. thanks Justin
Hi team Any news on this? Thanks Justin
Unfortunately, with how some of our configuration validation is delegated off into the plugins this isn't as simple as it would seem. We've discussed this, and have had an internal enhancement for this for a while. I'm closing this as a duplicate of the (now-public) bug 3445 which is tracking this issue, but I cannot commit to when or if we'll have this completed at this time. - Tim *** This ticket has been marked as a duplicate of ticket 3445 ***