On Sat, Oct 19, 2013 at 8:25 PM, Mark Knecht <markkne...@gmail.com> wrote: > OK, it's a good idea just to have a Konsole terminal open. That might > catch something.
I'm not sure if panics show up in konsole. With a virtual console the kernel actually outputs the message. Konsole under X11 is entirely user-mode and I'm not sure that ANY user-mode code can ever run after a panic. I think a virtual console is a better bet. > OK, so I remember years ago debugging something for Ingo Molnar using > the serial console, but in those days it was a real serial console on > a real serial port. None of my machine have those ports anymore. There > must be a more modern version of doing that. I'll go look for info. > Ethernet? USB? We've recently moved and the only other machine I've > got here at the apartment is a Gentoo laptop. That you'd have to look into. I'm not sure if the kernel can handle a serial console on a PL2302/etc. It might - it is all kernel-mode I think. You'd have to attach it to another device running a terminal emulator, assuming you don't have a vt100/etc lying around. > There's a gentoo.wiki.org page here: > > http://wiki.gentoo.org/wiki/Kernel_Crash_Dumps > > The setup looks reasonably straight forward so I've reconfigured > 3.10.17 following those instructions. Yeah, I forgot - that was actually started based on my blog entry, actually. It may very well have been improved on since. > > One question for now. In the Kernel Hacking section there's an option > for "Detect Hard and Soft Lockups" which on the surface looks like a > good thing to turn on but it's not mentioned in these instructions. Probably not a bad idea. > When turned on it has options for Panic (Reboot) for both types. Seems > like I probably want that all turned on? You could try setting it to no and see if you actually can capture any meaningful logs that way - there is a chance you could recover your system without rebooting. However, a panic would be the only real sure way to ensure a dump. Oh, and don't forget that there is a magic sysrq that triggers a panic. Only issue with that is that you'll have to hunt around for whatever caused the actual hangup because it won't be in the panic backtrace (that will just lead you to the sysrq code). > As I expected about the logs. If the machine's dead then I don't want > stuff getting written to disk anyway. kdump sounds like the best > solution going right now. I'll try and see if I can get it working. Yeah - one of these days I'll see if I can get kdump working again. What it really needs is an initramfs that will automatically capture the dump and reboot. That's how other distros handle it. The dumps are pretty big though - the size of your RAM. If you get a dump there are a bunch of tools that can be used to analyze it. Rich