Public bug reported:

I am experiencing long test runs on bionic and later series, on a NUMA
machines (that is, on machines that have more than one CPU on the
motherboard).

I discovered this while I was running MAAS stress-ng-cpu tests which
started timing out, on NUMA machines, when I changed commissioning
series to bionic (this is related bug: LP: #1826789).

I was running stress-ng as MAAS runs it, like this:

stress-ng --aggressive -a 0 --class cpu,cpu-cache --ignite-cpu -log-
brief --metrics-brief --times --tz --verify --timeout 1h

Maas actually runs the test for 12 hours, but the issue is visible even
with shorter test runs.

The above command on xenial, regardless of NUMA/non-NUMA machine,
completes within 60 minutes.

But on bionic and later series, on NUMA machines, the tests takes up to 70 
minutes, and on disco (with kernel 5) it takes cca 88 minutes for the test to 
complete.
On non-NUMA machines (that is, single CPU on the motherboard), the tests 
complete within 60 minutes, add 20-30 seconds on that, regardless of the series.

I was thinking that this might be normal behavior on bionic and later
kernels (4.15 and above), but then I installed Centos7 on the test NUMA
machine, and upgraded kernel to v5.0, and compiled latest version of
stress-ng (same as on disco) - the test completed within 60 minutes,
same as on xenial.

** Affects: stress-ng (Ubuntu)
     Importance: Undecided
         Status: New

** Description changed:

  I am experiencing long test runs on bionic and later series, on a NUMA
  machines (that is, on machines that have more than one CPU on the
  motherboard).
  
  I discovered this while I was running MAAS stress-ng-cpu tests which
  started timing out, on NUMA machines, when I changed commissioning
- series to bionic (this is related bug: LP #1826789).
+ series to bionic (this is related bug: LP: #1826789).
  
  I was running stress-ng as MAAS runs it, like this:
  
  stress-ng --aggressive -a 0 --class cpu,cpu-cache --ignite-cpu -log-
  brief --metrics-brief --times --tz --verify --timeout 1h
  
  Maas actually runs the test for 12 hours, but the issue is visible even
  with shorter test runs.
  
  The above command on xenial, regardless of NUMA/non-NUMA machine,
  completes within 60 minutes.
  
- But on bionic and later series, on NUMA machines, the tests takes up to 70 
minutes, and on disco (with kernel 5) it takes cca 88 minutes for the test to 
complete. 
+ But on bionic and later series, on NUMA machines, the tests takes up to 70 
minutes, and on disco (with kernel 5) it takes cca 88 minutes for the test to 
complete.
  On non-NUMA machines (that is, single CPU on the motherboard), the tests 
complete within 60 minutes, add 20-30 seconds on that, regardless of the series.
  
  I was thinking that this might be normal behavior on bionic and later
  kernels (4.15 and above), but then I installed Centos7 on the test NUMA
  machine, and upgraded kernel to v5.0, and compiled latest version of
  stress-ng (same as on disco) - the test completed within 60 minutes,
  same as on xenial.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1826791

Title:
  stress-ng cpu tests run longer than configured timeout

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/stress-ng/+bug/1826791/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to