** Description changed:

  Background:
  This is issue occurs during an automated process and occurs with a 1/20 
chance per iteration
  I have one lxc-container on the machine
  It is backed with an lvm2 snapshot
  Running on ubuntu 12.10 on ec2 small instance - upgraded from 12.04 fresh 
instance
  This is a new issue that has occurred after migrating my code from 11.10
  
  Process:
  create snapshot "lvcreate"
  mount snapshot "mount"
  lxc-start
  do actions in container
  lxc-stop
  unmount snapshot "umount"
  remove snapshot "lvremove"
  -repeat
  
  The issue can occur at either lxc-stop or lvremove.
  
  when it occurs with lxc-stop:
  ps -A reveals that lxc-start is still running along with kdmflush, kjournald, 
and init that appears to be the init process for the container
  kdmflush, kjournald, init or it's sub-processes cannot be killed with "kill 
-9 pid" but lxc-start can
  
  when it occurs with lvremove it occurs after lvremove is called again after 
failing the first time with stderr:
  Using logical volume(s) on command line
-     Archiving volume group "vmg1" metadata (seqno 272).
-     Removing snapshot snap
-     Found volume group "vmg1"
-     Found volume group "vmg1"
-     Loading vmg1-vm table (252:0)
-     Loading vmg1-snap table (252:1)
-   /sbin/dmeventd: stat failed: No such file or directory
-     vmg1/snapshot0 already not monitored.
-     Suspending vmg1-vm (252:0) with device flush
-     Suspending vmg1-snap (252:1) with device flush
-     Suspending vmg1-vm-real (252:2) with device flush
-     Suspending vmg1-snap-cow (252:3) with device flush
-     Found volume group "vmg1"
-     Resuming vmg1-snap-cow (252:3)
-     Resuming vmg1-vm-real (252:2)
-     Resuming vmg1-snap (252:1)
-     Removing vmg1-snap-cow (252:3)
-   device-mapper: remove ioctl on  failed: Device or resource busy
-   Unable to deactivate vmg1-snap-cow (252:3)
-   Failed to resume snap.
-   libdevmapper exiting with 1 device(s) still suspended.
+     Archiving volume group "vmg1" metadata (seqno 272).
+     Removing snapshot snap
+     Found volume group "vmg1"
+     Found volume group "vmg1"
+     Loading vmg1-vm table (252:0)
+     Loading vmg1-snap table (252:1)
+   /sbin/dmeventd: stat failed: No such file or directory
+     vmg1/snapshot0 already not monitored.
+     Suspending vmg1-vm (252:0) with device flush
+     Suspending vmg1-snap (252:1) with device flush
+     Suspending vmg1-vm-real (252:2) with device flush
+     Suspending vmg1-snap-cow (252:3) with device flush
+     Found volume group "vmg1"
+     Resuming vmg1-snap-cow (252:3)
+     Resuming vmg1-vm-real (252:2)
+     Resuming vmg1-snap (252:1)
+     Removing vmg1-snap-cow (252:3)
+   device-mapper: remove ioctl on  failed: Device or resource busy
+   Unable to deactivate vmg1-snap-cow (252:3)
+   Failed to resume snap.
+   libdevmapper exiting with 1 device(s) still suspended.
  
  lvremove spawns the lvm process and neither can be killed with "kill -9
  pid" which indicates to me that they are waiting for something from the
  kernel, and I am guessing this happens because of the same reason why
  lxc-stop also hangs, and the containers processes can not be killed.
  
- This is all I can report for now, but I'll try getting some log info
- from lxc next Friday, let me know if you have any suggestions in the
- meantime.
+ Here is an excerpt from the syslog - lxc-stop hangs because of this
+ kernel error, and the error doesn't always occur on the cat command, it
+ can happen on others as well.
+ 
+ --START
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.406366] kjournald 
starting.  Commit interval 5 seconds
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.406929] EXT3-fs (dm-1): 
using internal journal
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.406931] EXT3-fs (dm-1): 
mounted filesystem with ordered data mode
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.435218] device 
vethyurrCc entered promiscuous mode
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.435613] IPv6: 
ADDRCONF(NETDEV_UP): vethyurrCc: link is not ready
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.534518] IPv6: 
ADDRCONF(NETDEV_CHANGE): vethyurrCc: link becomes ready
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.534543] br0: port 
1(vethyurrCc) entered forwarding state
+ Oct 30 14:43:12 domU-12-31-39-14-64-79 kernel: [ 1094.534547] br0: port 
1(vethyurrCc) entered forwarding state
+ 
+ --STOP
+ Oct 30 14:43:17 domU-12-31-39-14-64-79 kernel: [ 1099.112881] br0: port 
1(vethyurrCc) entered disabled state
+ Oct 30 14:43:17 domU-12-31-39-14-64-79 kernel: [ 1099.115187] device 
vethyurrCc left promiscuous mode
+ Oct 30 14:43:17 domU-12-31-39-14-64-79 kernel: [ 1099.115190] br0: port 
1(vethyurrCc) entered disabled state
+ 
+ --START
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.188337] kjournald 
starting.  Commit interval 5 seconds
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.188852] EXT3-fs (dm-1): 
using internal journal
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.188859] EXT3-fs (dm-1): 
mounted filesystem with ordered data mode
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.310142] device 
vethfsh25j entered promiscuous mode
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.310539] IPv6: 
ADDRCONF(NETDEV_UP): vethfsh25j: link is not ready
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.319210] IPv6: 
ADDRCONF(NETDEV_CHANGE): vethfsh25j: link becomes ready
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.319240] br0: port 
1(vethfsh25j) entered forwarding state
+ Oct 30 14:43:18 domU-12-31-39-14-64-79 kernel: [ 1100.319244] br0: port 
1(vethfsh25j) entered forwarding state
+ 
+ --STOP
+ Oct 30 14:43:23 domU-12-31-39-14-64-79 kernel: [ 1105.073237] br0: port 
1(vethfsh25j) entered disabled state
+ Oct 30 14:43:23 domU-12-31-39-14-64-79 kernel: [ 1105.075541] device 
vethfsh25j left promiscuous mode
+ Oct 30 14:43:23 domU-12-31-39-14-64-79 kernel: [ 1105.075544] br0: port 
1(vethfsh25j) entered disabled state
+ 
+ --START
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.091653] kjournald 
starting.  Commit interval 5 seconds
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.092173] EXT3-fs (dm-1): 
using internal journal
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.092176] EXT3-fs (dm-1): 
mounted filesystem with ordered data mode
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.119867] device 
vethYI2DWn entered promiscuous mode
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.120382] IPv6: 
ADDRCONF(NETDEV_UP): vethYI2DWn: link is not ready
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.128936] IPv6: 
ADDRCONF(NETDEV_CHANGE): vethYI2DWn: link becomes ready
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.128964] br0: port 
1(vethYI2DWn) entered forwarding state
+ Oct 30 14:43:24 domU-12-31-39-14-64-79 kernel: [ 1106.128968] br0: port 
1(vethYI2DWn) entered forwarding state
+ 
+ --STOP
+ Oct 30 14:43:28 domU-12-31-39-14-64-79 kernel: [ 1110.816859] br0: port 
1(vethYI2DWn) entered disabled state
+ Oct 30 14:43:28 domU-12-31-39-14-64-79 kernel: [ 1110.819087] device 
vethYI2DWn left promiscuous mode
+ Oct 30 14:43:28 domU-12-31-39-14-64-79 kernel: [ 1110.819090] br0: port 
1(vethYI2DWn) entered disabled state
+ 
+ --Why is this happening occasionally?
+ Oct 30 14:43:29 domU-12-31-39-14-64-79 udevd[2811]: inotify_add_watch(6, 
/dev/dm-1, 10) failed: No such file or directory
+ 
+ --START
+ Oct 30 14:43:29 domU-12-31-39-14-64-79 kernel: [ 1111.748495] kjournald 
starting.  Commit interval 5 seconds
+ Oct 30 14:43:29 domU-12-31-39-14-64-79 kernel: [ 1111.748933] EXT3-fs (dm-1): 
using internal journal
+ Oct 30 14:43:29 domU-12-31-39-14-64-79 kernel: [ 1111.748936] EXT3-fs (dm-1): 
mounted filesystem with ordered data mode
+ Oct 30 14:43:29 domU-12-31-39-14-64-79 kernel: [ 1111.868572] device 
vethSaApSo entered promiscuous mode
+ Oct 30 14:43:29 domU-12-31-39-14-64-79 kernel: [ 1111.869304] IPv6: 
ADDRCONF(NETDEV_UP): vethSaApSo: link is not ready
+ Oct 30 14:43:30 domU-12-31-39-14-64-79 kernel: [ 1111.874370] IPv6: 
ADDRCONF(NETDEV_CHANGE): vethSaApSo: link becomes ready
+ Oct 30 14:43:30 domU-12-31-39-14-64-79 kernel: [ 1111.874394] br0: port 
1(vethSaApSo) entered forwarding state
+ Oct 30 14:43:30 domU-12-31-39-14-64-79 kernel: [ 1111.874398] br0: port 
1(vethSaApSo) entered forwarding state
+ 
+ --STOP
+ Oct 30 14:43:34 domU-12-31-39-14-64-79 kernel: [ 1116.749280] br0: port 
1(vethSaApSo) entered disabled state
+ Oct 30 14:43:34 domU-12-31-39-14-64-79 kernel: [ 1116.751502] device 
vethSaApSo left promiscuous mode
+ Oct 30 14:43:34 domU-12-31-39-14-64-79 kernel: [ 1116.751505] br0: port 
1(vethSaApSo) entered disabled state
+ 
+ --START
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.774270] kjournald 
starting.  Commit interval 5 seconds
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.774709] EXT3-fs (dm-1): 
using internal journal
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.774711] EXT3-fs (dm-1): 
mounted filesystem with ordered data mode
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.803322] device 
vethC8ic4K entered promiscuous mode
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.803718] IPv6: 
ADDRCONF(NETDEV_UP): vethC8ic4K: link is not ready
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.812401] IPv6: 
ADDRCONF(NETDEV_CHANGE): vethC8ic4K: link becomes ready
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.812458] br0: port 
1(vethC8ic4K) entered forwarding state
+ Oct 30 14:43:36 domU-12-31-39-14-64-79 kernel: [ 1118.812464] br0: port 
1(vethC8ic4K) entered forwarding state
+ 
+ --KERNEL ERROR
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252907] ------------[ 
cut here ]------------
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252921] kernel BUG at 
/build/buildd/linux-3.5.0/arch/x86/mm/fault.c:396!
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252926] invalid opcode: 
0000 [#1] SMP
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252932] CPU 0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252934] Modules linked 
in: veth dm_snapshot xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 
nf_conntrack nf_defrag_ipv4 ip_tables x_tables bridge stp llc isofs microcode 
acpiphp
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252958]
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252960] Pid: 8140, 
comm: cat Not tainted 3.5.0-17-generic #28-Ubuntu
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252966] RIP: 
e030:[<ffffffff8168533f>]  [<ffffffff8168533f>] vmalloc_fault+0x11f/0x208
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252979] RSP: 
e02b:ffff880002f1d9b8  EFLAGS: 00010046
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252983] RAX: 
ffff880026caeff8 RBX: ffffe8ffffc00ac8 RCX: 0000000000000000
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252988] RDX: 
00003ffffffff000 RSI: ffff880000000ff8 RDI: 0000000000000000
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252993] RBP: 
ffff880002f1d9d8 R08: ffff880017c6ae70 R09: 00007f7b4d46e000
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.252998] R10: 
0000000000000001 R11: 0000000000000000 R12: ffff880066231e88
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253003] R13: 
ffff880026caeff8 R14: ffff880000000ff8 R15: 0000000000000002
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253012] FS:  
00007f7b4d68c700(0000) GS:ffff88006a000000(0000) knlGS:0000000000000000
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253017] CS:  e033 DS: 
0000 ES: 0000 CR0: 000000008005003b
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253021] CR2: 
ffffe8ffffc00ac8 CR3: 0000000066231000 CR4: 0000000000002660
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253027] DR0: 
0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253033] DR3: 
0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253038] Process cat 
(pid: 8140, threadinfo ffff880002f1c000, task ffff88002470dc00)
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253044] Stack:
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253046]  
ffffe8ffffc00ac8 0000000000000029 ffff880002f1daf8 0000000000000000
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253055]  
ffff880002f1dae8 ffffffff816858f9 0000000000000657 ffffffff812e79e1
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253064]  
ffff88002470dc00 0000000000000060 ffff880055ecdd1c ffff88005636b540
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253072] Call Trace:
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253078]  
[<ffffffff816858f9>] do_page_fault+0x3b9/0x4e0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253087]  
[<ffffffff812e79e1>] ? aa_path_name+0x71/0x440
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253094]  
[<ffffffff8107e86a>] ? lg_local_unlock+0x1a/0x20
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253100]  
[<ffffffff8168b14b>] ? xen_hypervisor_callback+0x1b/0x20
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253106]  
[<ffffffff81004eec>] ? xen_mc_extend_args+0xec/0x110
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253112]  
[<ffffffff810046c0>] ? load_TLS_descriptor+0x40/0xc0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253118]  
[<ffffffff81004bd2>] ? xen_mc_flush+0xb2/0x1b0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253123]  
[<ffffffff816821e5>] page_fault+0x25/0x30
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253130]  
[<ffffffff81176e54>] ? mem_cgroup_charge_statistics.isra.15+0x14/0x50
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253137]  
[<ffffffff81178ebc>] __mem_cgroup_uncharge_common+0xcc/0x2c0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253143]  
[<ffffffff8100761d>] ? xen_pte_val+0x1d/0x40
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253149]  
[<ffffffff8117c242>] mem_cgroup_uncharge_page+0x22/0x30
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253155]  
[<ffffffff81153c97>] page_remove_rmap+0xb7/0x140
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253162]  
[<ffffffff8114797a>] ? vm_normal_page+0x1a/0x80
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253168]  
[<ffffffff81148c31>] unmap_page_range+0x4b1/0x740
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253173]  
[<ffffffff81148f4b>] unmap_single_vma+0x8b/0xd0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253179]  
[<ffffffff81149762>] unmap_vmas+0x52/0xa0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253184]  
[<ffffffff81150cf2>] exit_mmap+0x92/0x150
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253191]  
[<ffffffff81681dbe>] ? _raw_spin_lock_irqsave+0x2e/0x40
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253198]  
[<ffffffff8104ef24>] mmput+0x74/0x110
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253204]  
[<ffffffff810577ba>] exit_mm+0x10a/0x130
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253208]  
[<ffffffff81057939>] do_exit+0x159/0x8e0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253213]  
[<ffffffff8105841f>] do_group_exit+0x3f/0xa0
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253218]  
[<ffffffff81058497>] sys_exit_group+0x17/0x20
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253224]  
[<ffffffff81689d29>] system_call_fastpath+0x16/0x1b
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253229] Code: 4c 89 e7 
e8 71 e4 fe ff 4c 89 ef 48 89 de 49 89 c6 e8 63 e4 fe ff 48 83 38 00 49 89 c5 
0f 84 e5 00 00 00 49 8b 3e 48 85 ff 75 02 <0f> 0b ff 14 25 e0 dd c1 81 48 89 c2 
4$
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253285] RIP  
[<ffffffff8168533f>] vmalloc_fault+0x11f/0x208
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253291]  RSP 
<ffff880002f1d9b8>
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253307] ---[ end trace 
0e83c1ffecd3a6f4 ]---
+ Oct 30 14:43:39 domU-12-31-39-14-64-79 kernel: [ 1121.253312] Fixing 
recursive fault but reboot is needed!

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1071910

Title:
  lxc stop will hang forever

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/lxc/+bug/1071910/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to