According to the PMIx standard, the PMIX_NODEID key must be included in the process level data. Refs: https://pmix.github.io/uploads/2020/12/pmix-standard-v4.0.pdf Error reproduction: $srun -N2 --mpi=pmix_v4 ./pmix_client -n 2 ==46645== [1611566237.993365] ERROR [pmix_client.c:129:main]: rank 0: PMIx_Get nodeid failed: NOT-FOUND srun: error: jazz29: task 0: Exited with exit code 210 srun: launch/slurm: _step_signal: Terminating StepId=2.9 srun: error: jazz30: task 1: Terminated srun: Force Terminated StepId=2.9
Created attachment 17593 [details] bug10684_2011.patch
(In reply to Boris Karasev from comment #1) > Created attachment 17593 [details] > bug10684_2011.patch This patch depends on https://bugs.schedmd.com/show_bug.cgi?id=7263.
(In reply to Boris Karasev from comment #2) > (In reply to Boris Karasev from comment #1) > > Created attachment 17593 [details] > > bug10684_2011.patch > > This patch depends on https://bugs.schedmd.com/show_bug.cgi?id=7263. There is no direct dependence on 7263. It is simply reproduced using the PMIx v4 test suite, which support was added in 7263.
Comment on attachment 17593 [details] bug10684_2011.patch commit f950cc9831e2c808e7b25e057950ab8e7e121778 Author: Boris Karasev <karasev.b@gmail.com> AuthorDate: Fri Jan 22 06:45:54 2021 +0200 mpi/pmix: include PMIX_NODEID for each process entry. According to the PMIx standard, the PMIX_NODEID key must be included in the process level data. Refs: https://pmix.github.io/uploads/2020/12/pmix-standard-v4.0.pdf Bug 10684.
Thanks Boris. Committed ahead of 20.11.4.