** Description changed: + [Impact] + + The bug is about simplestreams possibly getting stuck waiting forever + for an an HTTP response that never comes, e.g. because of networking + issues. This can potentially affect any package depending on + simplestreams, but specifically it was reported affecting MAAS, where it + causes server deployments to timeout. + + [Test Plan] + + Ideally this should be tested by building a MAAS snap with the + simplestreams package including the fix, verifying that is works as + expected. + + [Regression Potential] + + Very little. Scenarios where it takes more than 10s for a remote server + to provide simplestreams with the data it requested are unlikely, but + can't be fully excluded. + + [Original Description] + = How to determine you are seeing this problem = Does your MAAS server seem to get "hung up", where deployments suddenly start failing w/ lots of connection timeouts to the MAAS server? Get a list of pids of your regiond processes: $ ps -ef | grep regiond Run strace on each one to see if one is stuck in a connect() or recv() call: $ sudo strace -p $pid recv(... (normally you should see a lot of epoll_ctl() calls go by if not hung) If one is hung, use lsof to see what it is connected to: sudo lsof -i -a -p $pid If you see an open connection to your images server, then this maybe your problem. sudo kill -9 of the hung pid will cause it to respawn and recover.
-- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1908452 Title: MAAS stops working and deployment fails after `Loading ephemeral` step To manage notifications about this bug go to: https://bugs.launchpad.net/maas/+bug/1908452/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs