** Description changed:

+ [Impact]
+ 
+ The bug is about simplestreams possibly getting stuck waiting forever
+ for an an HTTP response that never comes, e.g. because of networking
+ issues. This can potentially affect any package depending on
+ simplestreams, but specifically it was reported affecting MAAS, where it
+ causes server deployments to timeout.
+ 
+ [Test Plan]
+ 
+ Ideally this should be tested by building a MAAS snap with the
+ simplestreams package including the fix, verifying that is works as
+ expected.
+ 
+ [Regression Potential]
+ 
+ Very little. Scenarios where it takes more than 10s for a remote server
+ to provide simplestreams with the data it requested are unlikely, but
+ can't be fully excluded.
+ 
+ [Original Description]
+ 
  = How to determine you are seeing this problem =
  Does your MAAS server seem to get "hung up", where deployments suddenly start 
failing w/ lots of connection timeouts to the MAAS server?
  
  Get a list of pids of your regiond processes:
  $ ps -ef | grep regiond
  
  Run strace on each one to see if one is stuck in a connect() or recv() call:
  $ sudo strace -p $pid
  recv(...
  
  (normally you should see a lot of epoll_ctl() calls go by if not hung)
  
  If one is hung, use lsof to see what it is connected to:
  sudo lsof -i -a -p $pid
  
  If you see an open connection to your images server, then this maybe
  your problem. sudo kill -9 of the hung pid will cause it to respawn and
  recover.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1908452

Title:
  MAAS stops working and deployment fails after `Loading ephemeral` step

To manage notifications about this bug go to:
https://bugs.launchpad.net/maas/+bug/1908452/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to