On 10/3/20 1:40 pm, mike tie wrote:

Here is the output of lstopo

Hmm, well I believe Slurm should be using hwloc (which provides lstopo) to get its information (at least it calls the xcpuinfo_hwloc_topo_get() function for that), so if lstopo works then slurmd should too.

Ah, looking a bit deeper I see in src/slurmd/common/xcpuinfo.c:

        if (!hwloc_xml_whole)
                hwloc_xml_whole = xstrdup_printf("%s/hwloc_topo_whole.xml",
                                                 conf->spooldir);

Do you happen to have a file called "hwloc_topo_whole.xml" in your spool directory on that node? I'm wondering if it's cached old config there.

If so move it out of the way somewhere safe (just in case) and try again.

All the best,
Chris
--
 Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA

Reply via email to