Summary: | Recompiled plugins appear non-functional under Slurm 20.02 | ||
---|---|---|---|
Product: | Slurm | Reporter: | Chris Samuel (NERSC) <csamuel> |
Component: | Other | Assignee: | Marshall Garey <marshall> |
Status: | RESOLVED DUPLICATE | QA Contact: | |
Severity: | 3 - Medium Impact | ||
Priority: | --- | CC: | agaur, bart, dmjacobsen, marshall |
Version: | 20.02.3 | ||
Hardware: | Linux | ||
OS: | Linux | ||
See Also: | https://bugs.schedmd.com/show_bug.cgi?id=9160 | ||
Site: | NERSC | Alineos Sites: | --- |
Atos/Eviden Sites: | --- | Confidential Site: | --- |
Coreweave sites: | --- | Cray Sites: | --- |
DS9 clusters: | --- | HPCnow Sites: | --- |
HPE Sites: | --- | IBM Sites: | --- |
NOAA SIte: | --- | OCF Sites: | --- |
Recursion Pharma Sites: | --- | SFW Sites: | --- |
SNIC sites: | --- | Linux Distro: | --- |
Machine Name: | CLE Version: | ||
Version Fixed: | Target Release: | --- | |
DevPrio: | --- | Emory-Cloud Sites: | --- |
Description
Chris Samuel (NERSC)
2020-06-08 12:07:38 MDT
Tracking through the SDN code and running the slurmd and debug3 it looks like it's failing because a file that should be created by slurm_spank_job_prolog() does not exist, and Aditi and I are suspecting that's not being called. Is that possible? Hi there, Is it possible that the work done in https://bugs.schedmd.com/show_bug.cgi?id=7286 has caused slurm_spank_job_prolog() to suddenly no longer work? All the best, Chris Hi, Chris, this looks like it may be a duplicate of bug #9081 comment#5 As Dominik noted there: >I think I found the source of this regression. >You need to directly set PlugStackConfig in slurm.conf. >Let me know if this helps. We are still looking into fixing this in but#9081. Would you try the above workaround and let us know if this changes the behavior for you so that we can confirm the issue? Hi Jason, Thanks so much for that pointer, we'll try that out today. I suspect this is also what my friend Laszlo back in Melbourne is seeing too: https://bugs.schedmd.com/show_bug.cgi?id=9160 All the best, Chris >I suspect this is also what my friend Laszlo back in Melbourne is seeing too: We have linked those two issues. There is discussion internally about this and this does seem like a duplicate of bug #9160. As I did for 9160, I'm closing this as a duplicate of bug 9081. Please re-open it if setting PlugStackConfig in slurm.conf doesn't work. *** This ticket has been marked as a duplicate of ticket 9081 *** |