I really appreciate the input everyone. I guess I was a little overwhelmed
dealing with a few different issues at once .... I didn't mean to place the
blame on MAAS. That being said, node disk replacement under the direction
of MAAS is still a rugged process for me. I understand that pxelinux/bios
may be the root cause of my issue ... I guess I feel like MAAS had more to
do with this due to MAAS not being able to recognize new disk after
replacement w/o recommissioning. I feel like despite the boot issue, I
would still need to recommission and down the node for MAAS to take
inventory of  the new disk after a replacement. Is this being looked into
for 1.9?

Thanks again,

James


On Mon, Jan 25, 2016 at 6:41 AM, Gavin Panella <gavin.pane...@canonical.com>
wrote:

> Even when a node has been deployed, the node still attempts to PXE boot
> from MAAS each time it's rebooted. MAAS knows it should boot locally and
> gives the following configuration to PXELINUX:
>
>   DEFAULT local
>
>   LABEL local
>     LOCALBOOT 0
>
> It appears that this does not do the right thing for your hardware. Put
> another way, it does not do the same thing as your machine's BIOS does
> when the network is unavailable.
>
> I suspect this is a bug in PXELINUX and/or your hardare. There may be
> something that MAAS can do to help, but I don't think it's the cause, so
> I'll target this bug at PXELINUX and mark it Invalid in MAAS for now.
>
>
> ** Also affects: syslinux (Ubuntu)
>    Importance: Undecided
>        Status: New
>
> ** Changed in: maas
>        Status: Confirmed => Invalid
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1488594
>
> Title:
>   Nodes cannot boot after a storage disk replacement
>
> Status in MAAS:
>   Invalid
> Status in syslinux package in Ubuntu:
>   New
>
> Bug description:
>   I'm experiencing this issue when I replace any osd disk on any ceph
>   storage node and then reboot it. Immediatly after the node pxe boots,
>   the node will hang at a "booting local disk" message and fails to
>   timeout or boot. A work-around I've found to get a node to boot after
>   a storage disk replacement is to momentarilly disable maas from
>   managing the network after the power on of a node who's disk has been
>   replaced; following that, after the node pxe boot times out and it
>   results to booting from local disk into the os, I re-enable maas
>   management on that network so the node gets an ip and continues the
>   boot process and eventually successfully boots.
>
>   It would be nice to get some feedback on what is going on here, and
>   also a best practice for what/how to proceed in the case when you need
>   to swap storage disks.
>
>   Thanks!
>
>   maas.log <-- http://paste.ubuntu.com/12193844/
>
>   clusterd.log <-- http://paste.ubuntu.com/12193842/
>
>   maas - 1.8.0+bzr4001-0ubuntu2~trusty1
>   trusty - 14.04.3
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/maas/+bug/1488594/+subscriptions
>

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1488594

Title:
  Nodes cannot boot after a storage disk replacement

To manage notifications about this bug go to:
https://bugs.launchpad.net/maas/+bug/1488594/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to