Summary: | Cannot run interactive job | ||
---|---|---|---|
Product: | Slurm | Reporter: | ssingh |
Component: | User Commands | Assignee: | Jacob Jenson <jacob> |
Status: | RESOLVED INVALID | QA Contact: | |
Severity: | 6 - No support contract | ||
Priority: | --- | CC: | ssingh |
Version: | 20.02.0 | ||
Hardware: | Linux | ||
OS: | Linux | ||
Site: | -Other- | Alineos Sites: | --- |
Atos/Eviden Sites: | --- | Confidential Site: | --- |
Coreweave sites: | --- | Cray Sites: | --- |
DS9 clusters: | --- | HPCnow Sites: | --- |
HPE Sites: | --- | IBM Sites: | --- |
NOAA SIte: | --- | OCF Sites: | --- |
Recursion Pharma Sites: | --- | SFW Sites: | --- |
SNIC sites: | --- | Linux Distro: | CentOS |
Machine Name: | CLE Version: | ||
Version Fixed: | Target Release: | --- | |
DevPrio: | --- | Emory-Cloud Sites: | --- |
Description
ssingh
2020-03-25 07:19:29 MDT
I have also looked at the past report of these issues and the firewall between the two hosts is not an issue as the interface that is used by the compute nodes to communicate with the head node accepts all traffic and I have stopped the firewall to test and have gotten the same results. -Sajesh- Running an strace against slurmd on the compute host show that this might be the problem: chown("/dev/pts/1", 1326, 7) = -1 EPERM (Operation not permitted) Not sure how to fix this as I can ssh to the compute node as the user with the UID of 1326 -Sajesh- Updated to Slurm 20.02.0 but issue still persists |