Your message dated Mon, 23 Apr 2012 02:58:51 +0100
with message-id <1335146331.4056.17.camel@deadeye>
and subject line Re: Bug#637234: linux-image-3.0.0-1-686-pae: I/O errors using
ext4 under xen (also affects ext3 as of linux-image-3.1.0-1-amd64 et al)
has caused the Debian Bug report #637234,
regarding linux-image-3.0.0-1-686-pae: I/O errors using ext4 under xen (also
affects ext3 as of linux-image-3.1.0-1-amd64 et al)
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact ow...@bugs.debian.org
immediately.)
--
637234: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=637234
Debian Bug Tracking System
Contact ow...@bugs.debian.org with problems
--- Begin Message ---
Package: linux-2.6
Version: 3.0.0-1
Severity: important
Hello,
I have a xen host running debian squeeze, amd64, some of the DomU's are
running wheezy. My mail server is a DomU called "mail", using ext4 for the
root (and other) FS. A dist-upgrade on "mail" has upgraded the kernel to
linux-image-3.0.0-1-686-pae, and at this point I started getting I/O errors
during the boot process, as follows:
-----------
Starting MySQL database server: mysqld[ 6.453894] end_request: I/O error,
dev xvda, sector 4456704
[ 6.453919] end_request: I/O error, dev xvda, sector 4456704
[ 6.453964] Aborting journal on device xvda-8.
[ 6.462873] EXT4-fs error (device xvda): ext4_journal_start_sb:296: Detected
aborted journal
[ 6.462903] EXT4-fs (xvda): Remounting filesystem read-only
[ 6.463276] journal commit I/O error
. . . . . . . . . . . . . . failed!
Starting MTA: exim4.
Starting IMAP/POP3 mail server: dovecot.
startpar: service(s) returned failure: mysql ... failed!
-----------
So I went ahead and installed wheezy on a brand new DomU, and this
was repeated immediately when booting the machine after the installation
completed.
-----------
Starting NFS common utilities: statd[ 3.977392] end_request: I/O error, dev
xvda, sector 4456808
[ 3.977415] end_request: I/O error, dev xvda, sector 4456808
[ 3.977470] Aborting journal on device xvda-8.
[ 3.990442] journal commit I/O error
[ 3.991041] EXT4-fs error (device xvda): ext4_journal_start_sb:296: Detected
aborted journal
[ 3.991126] EXT4-fs (xvda): Remounting filesystem read-only
failed!
Cleaning up temporary files....
Setting up console font and keymap...done.
startpar: service(s) returned failure: nfs-common ... failed!
INIT: Entering runlevel: 2
Using makefile-style concurrent boot in runlevel 2.
Starting rpcbind daemon...Already running..
Starting NFS common utilities: statd failed!
touch: cannot touch `/var/log/dmesg.new': Read-only file system
chown: cannot access `/var/log/dmesg.new': No such file or directory
chmod: cannot access `/var/log/dmesg.new': No such file or directory
ln: creating hard link `/var/log//dmesg.0': Read-only file system
... etc. ...
-----------
Now, it happenes this way exactly every _other_ time the machines boot.
When I reboot after these I/O errors, fsck is run and then the machine
seems to be actually fine until the next reboot when it all happens
again.
For me, this is happening on xen DomU's, only when running linux
3.0.0-1-686-pae, only when using ext4 for the root FS.
No problems when booting back to 2.6.39-2-686-pae.
Please let me know what more specific testing needs to be done, if
necessary I can test more platforms / flavors.
I have observed nothing to suggest this is related to xen, it's just my
platform here.
-- Package-specific info:
** Version:
Linux version 3.0.0-1-686-pae (Debian 3.0.0-1) (b...@decadent.org.uk) (gcc
version 4.5.3 (Debian 4.5.3-3) ) #1 SMP Sun Jul 24 14:27:32 UTC 2011
** Command line:
root=UUID=8a1a7bca-b0e2-4714-baf1-b852eab25843 ro quiet
** Not tainted
** Kernel log:
[ 0.016117] PCI: System does not support PCI
[ 0.016120] PCI: System does not support PCI
[ 0.016231] Switching to clocksource xen
[ 0.017739] pnp: PnP ACPI: disabled
[ 0.017742] PnPBIOS: Disabled
[ 0.018820] Switched to NOHz mode on CPU #1
[ 0.018902] Switched to NOHz mode on CPU #0
[ 0.020460] PCI: max bus depth: 0 pci_try_num: 1
[ 0.020696] NET: Registered protocol family 2
[ 0.020967] IP route cache hash table entries: 8192 (order: 3, 32768 bytes)
[ 0.021437] TCP established hash table entries: 32768 (order: 6, 262144
bytes)
[ 0.021752] TCP bind hash table entries: 32768 (order: 6, 262144 bytes)
[ 0.022063] TCP: Hash tables configured (established 32768 bind 32768)
[ 0.022069] TCP reno registered
[ 0.022077] UDP hash table entries: 512 (order: 2, 16384 bytes)
[ 0.022100] UDP-Lite hash table entries: 512 (order: 2, 16384 bytes)
[ 0.022469] NET: Registered protocol family 1
[ 0.022486] PCI: CLS 0 bytes, default 64
[ 0.022574] Unpacking initramfs...
[ 0.042069] Freeing initrd memory: 22480k freed
[ 0.046257] platform rtc_cmos: registered platform RTC device (no PNP device
found)
[ 0.046605] audit: initializing netlink socket (disabled)
[ 0.046616] type=2000 audit(1312911347.921:1): initialized
[ 0.056740] HugeTLB registered 2 MB page size, pre-allocated 0 pages
[ 0.057039] VFS: Disk quotas dquot_6.5.2
[ 0.057099] Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
[ 0.057194] msgmni has been set to 999
[ 0.057354] alg: No test for stdrng (krng)
[ 0.057382] Block layer SCSI generic (bsg) driver version 0.4 loaded (major
253)
[ 0.057386] io scheduler noop registered
[ 0.057388] io scheduler deadline registered
[ 0.057402] io scheduler cfq registered (default)
[ 0.057598] isapnp: Scanning for PnP cards...
[ 0.409558] isapnp: No Plug & Play device found
[ 0.409873] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled
[ 0.412773] Linux agpgart interface v0.103
[ 0.413203] i8042: PNP: No PS/2 controller found. Probing ports directly.
[ 0.414033] i8042: No controller found
[ 0.414227] mousedev: PS/2 mouse device common for all mice
[ 0.454109] rtc_cmos rtc_cmos: rtc core: registered rtc_cmos as rtc0
[ 0.454143] rtc_cmos: probe of rtc_cmos failed with error -38
[ 0.454162] cpuidle: using governor ladder
[ 0.454164] cpuidle: using governor menu
[ 0.454336] TCP cubic registered
[ 0.454455] NET: Registered protocol family 10
[ 0.454980] Mobile IPv6
[ 0.454983] NET: Registered protocol family 17
[ 0.454987] Registering the dns_resolver key type
[ 0.455001] Using IPI No-Shortcut mode
[ 0.455069] PM: Hibernation image not present or could not be loaded.
[ 0.455080] registered taskstats version 1
[ 0.455093] XENBUS: Device with no driver: device/vbd/51712
[ 0.455095] XENBUS: Device with no driver: device/vbd/51744
[ 0.455097] XENBUS: Device with no driver: device/vif/0
[ 0.455099] XENBUS: Device with no driver: device/vif/1
[ 0.455102] XENBUS: Device with no driver: device/console/0
[ 0.455114]
/build/buildd-linux-2.6_3.0.0-1-i386-ML66CU/linux-2.6-3.0.0/debian/build/source_i386_none/drivers/rtc/hctosys.c:
unable to open rtc device (rtc0)
[ 0.455175] Initializing network drop monitor service
[ 0.455438] Freeing unused kernel memory: 404k freed
[ 0.456030] Write protecting the kernel text: 2768k
[ 0.456248] Write protecting the kernel read-only data: 1068k
[ 0.456250] NX-protecting the kernel data: 3376k
[ 0.490525] udevd[50]: starting version 172
[ 0.510452] Initialising Xen virtual ethernet driver.
[ 0.526964] blkfront: xvda: barrier: enabled
[ 0.528495] xvda:
[ 0.528633] Setting capacity to 10485760
[ 0.528637] xvda: detected capacity change from 0 to 5368709120
[ 0.529412] blkfront: xvdc: barrier: enabled
[ 0.558774] xvdc: unknown partition table
[ 0.559489] Setting capacity to 1048576
[ 0.559502] xvdc: detected capacity change from 0 to 536870912
[ 0.973128] PM: Starting manual resume from disk
[ 0.973131] PM: Hibernation image partition 202:32 present
[ 0.973133] PM: Looking for hibernation image.
[ 0.973405] PM: Image not found (code -22)
[ 0.973408] PM: Hibernation image not present or could not be loaded.
[ 0.983577] EXT4-fs (xvda): INFO: recovery required on readonly filesystem
[ 0.983581] EXT4-fs (xvda): write access will be enabled during recovery
[ 1.024513] EXT4-fs warning (device xvda): ext4_clear_journal_err:4155:
Filesystem error recorded from previous mount: IO failure
[ 1.024524] EXT4-fs warning (device xvda): ext4_clear_journal_err:4156:
Marking fs in need of filesystem check.
[ 1.025790] EXT4-fs (xvda): recovery complete
[ 1.026596] EXT4-fs (xvda): mounted filesystem with ordered data mode. Opts:
(null)
[ 1.928491] udevd[160]: starting version 172
[ 2.124852] input: PC Speaker as /devices/platform/pcspkr/input/input0
[ 2.204922] Error: Driver 'pcspkr' is already registered, aborting...
[ 2.550476] Adding 524284k swap on /dev/xvdc. Priority:-1 extents:1
across:524284k SS
[ 2.564932] EXT4-fs (xvda): re-mounted. Opts: (null)
[ 3.156251] blkfront: barrier: empty write xvda op failed
[ 3.156255] blkfront: xvda: barrier or flush: disabled
[ 3.185628] EXT4-fs (xvda): re-mounted. Opts: errors=remount-ro
[ 3.251006] loop: module loaded
[ 4.326336] RPC: Registered named UNIX socket transport module.
[ 4.326344] RPC: Registered udp transport module.
[ 4.326350] RPC: Registered tcp transport module.
[ 4.326356] RPC: Registered tcp NFSv4.1 backchannel transport module.
[ 4.361714] FS-Cache: Loaded
[ 4.382614] FS-Cache: Netfs 'nfs' registered for caching
[ 4.402479] Installing knfsd (copyright (C) 1996 o...@monad.swb.de).
[ 14.460105] eth0: no IPv6 routers present
** Model information
not available
** Loaded modules:
Module Size Used by
nfsd 197933 2
nfs 218404 0
lockd 61314 2 nfsd,nfs
fscache 31952 1 nfs
auth_rpcgss 32183 2 nfsd,nfs
nfs_acl 12463 2 nfsd,nfs
sunrpc 139050 6 nfsd,nfs,lockd,auth_rpcgss,nfs_acl
loop 17866 0
evdev 12995 0
snd_pcm 53315 0
snd_timer 22027 1 snd_pcm
snd 38562 2 snd_pcm,snd_timer
soundcore 12992 1 snd
snd_page_alloc 12899 1 snd_pcm
pcspkr 12515 0
ext4 274801 1
mbcache 12898 1 ext4
jbd2 56798 1 ext4
crc16 12327 1 ext4
xen_netfront 21670 0
xen_blkfront 17215 2
** PCI devices:
** USB devices:
not available
-- System Information:
Debian Release: wheezy/sid
APT prefers testing
APT policy: (500, 'testing')
Architecture: i386 (i686)
Kernel: Linux 3.0.0-1-686-pae (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages linux-image-3.0.0-1-686-pae depends on:
ii debconf [debconf-2.0] 1.5.40 Debian configuration management sy
ii initramfs-tools [linux-initra 0.99 tools for generating an initramfs
ii linux-base 3.3 Linux image base package
ii module-init-tools 3.16-1 tools for managing Linux kernel mo
Versions of packages linux-image-3.0.0-1-686-pae recommends:
pn firmware-linux-free <none> (no description available)
ii libc6-i686 2.13-10 Embedded GNU C Library: Shared lib
Versions of packages linux-image-3.0.0-1-686-pae suggests:
ii grub-pc 1.99-9 GRand Unified Bootloader, version
pn linux-doc-3.0.0 <none> (no description available)
Versions of packages linux-image-3.0.0-1-686-pae is related to:
pn firmware-bnx2 <none> (no description available)
pn firmware-bnx2x <none> (no description available)
pn firmware-ipw2x00 <none> (no description available)
pn firmware-ivtv <none> (no description available)
pn firmware-iwlwifi <none> (no description available)
pn firmware-linux <none> (no description available)
pn firmware-linux-nonfree <none> (no description available)
pn firmware-qlogic <none> (no description available)
pn firmware-ralink <none> (no description available)
pn xen-hypervisor <none> (no description available)
-- debconf information:
linux-image-3.0.0-1-686-pae/prerm/removing-running-kernel-3.0.0-1-686-pae:
true
linux-image-3.0.0-1-686-pae/postinst/ignoring-ramdisk:
linux-image-3.0.0-1-686-pae/postinst/missing-firmware-3.0.0-1-686-pae:
linux-image-3.0.0-1-686-pae/postinst/depmod-error-initrd-3.0.0-1-686-pae:
false
--- End Message ---
--- Begin Message ---
Version: 2.6.32-41
On Sun, 2012-03-04 at 12:18 -0500, Gedalya wrote:
> notfound 637234 3.2.6-1
> notfound 637234 2.6.32-41
> thanks
This didn't do what you intended; 'notfound' only reverses 'found'. The
right command is 'fixed' or 'close'.
> Timo & all,
>
> This has been a bug in the dom0 kernel. If I've been following this
> bug correctly, then the only thing about the 3.0+ kernels is that they
> have barriers enabled by default, which we then disable as a
> workaround. But this is supposed to work, and it was mishandled by the
> kernel in dom0. I therefore see no point in marking this bug as found
> in 3.0+ kernels used in domU's.
>
> This bug was apparently fixed in 2.6.32-40 or 2.6.32-41, I waited for
> 2.6.32-41 and now I'm running that kernel in my dom0. You have to
> upgrade your dom0 to the latest kernel. I now have various wheezy
> domU's with barriers enabled again, running with no issues.
Therefore closing this properly.
Ben.
--
Ben Hutchings
For every action, there is an equal and opposite criticism. - Harrison
signature.asc
Description: This is a digitally signed message part
--- End Message ---