Bug#542250: repeatable crashes while copying 500G from NFS

2009-10-09 Thread Ben Hutchings
On Fri, 2009-10-09 at 11:56 +0200, Markus Hochholdinger wrote: > Hello, > > in my case, the workaround to limit "dom0-cpus=1" makes my dom0s stable. But > i > can't migrate (--live) any domUs any more. Also shutdown/halt of a domU hangs > after shutdown/halt. This doesn't happen with dom0-cpus=

Bug#542250: repeatable crashes while copying 500G from NFS

2009-10-09 Thread Markus Hochholdinger
Hello, in my case, the workaround to limit "dom0-cpus=1" makes my dom0s stable. But i can't migrate (--live) any domUs any more. Also shutdown/halt of a domU hangs after shutdown/halt. This doesn't happen with dom0-cpus=0. Can anyone tell where to find 2.6.26-20? Till now i didn't find this on

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-09-06 Thread Ben Hutchings
On Sun, 2009-09-06 at 10:57 +0400, Nikita V. Youshchenko wrote: > > > Is there any chance to get this fix into lenny? AFAIU, this bug may > > > affect both i386 and amd64, both dom0 and domU [if domU is running > > > -xen flavour], almost on any hardware with >1 CPUs, and cause system > > > crash.

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-09-06 Thread Nikita V. Youshchenko
> > Is there any chance to get this fix into lenny? AFAIU, this bug may > > affect both i386 and amd64, both dom0 and domU [if domU is running > > -xen flavour], almost on any hardware with >1 CPUs, and cause system > > crash... > > Yes, I have added it to the lenny branch. Hi I don't see any me

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-21 Thread Ben Hutchings
On Fri, 2009-08-21 at 11:06 +0400, Nikita V. Youshchenko wrote: > tags 542250 +patch > thanks > > > > That means, do_IRQ can't be called on the same CPU before previous > > > invocation returns. So multiple levels of "hard-interrupt contexts" > > > just can't exist. > > > > > > Then even 2 spinnin

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-21 Thread Nikita V. Youshchenko
> Re-tagging it as 'patch' then. > > Is there any chance to get this fix into lenny? AFAIU, this bug may > affect both i386 and amd64, both dom0 and domU [if domU is running -xen > flavour], almost on any hardware with >1 CPUs, and cause system crash... For now, I've placed amd64 and i386 debs wi

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-21 Thread Nikita V. Youshchenko
tags 542250 +patch thanks > > That means, do_IRQ can't be called on the same CPU before previous > > invocation returns. So multiple levels of "hard-interrupt contexts" > > just can't exist. > > > > Then even 2 spinning-state structures per cpu should be enough. No > > need to have more. > > > > A

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-20 Thread Ben Hutchings
On Thu, 2009-08-20 at 18:40 +0400, Nikita V. Youshchenko wrote: > > There are three spinning-state structures (per CPU) here, allowing for > > spinning in process context, in bottom-half (softirq) context, and in > > hard-interrupt context. > > There is an interrupt flag (called IRQF_DISABLED in 2

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-20 Thread Nikita V. Youshchenko
> There are three spinning-state structures (per CPU) here, allowing for > spinning in process context, in bottom-half (softirq) context, and in > hard-interrupt context. There is an interrupt flag (called IRQF_DISABLED in 2.6.26, it was SA_NODELAY at some point in the past AFAIR) that controls i

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-20 Thread Ben Hutchings
On Thu, 2009-08-20 at 09:21 +0400, Nikita V. Youshchenko wrote: > > On Wed, 2009-08-19 at 22:36 +0400, Nikita V. Youshchenko wrote: [...] > > I'm glad it works for you, but it isn't a proper fix. > > Could you please explain? How that code line cod hit if not in interrupt > handler? > > Here is

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-19 Thread Nikita V. Youshchenko
> On Wed, 2009-08-19 at 22:36 +0400, Nikita V. Youshchenko wrote: > > tags 542250 +patch > > thanks > > > > > ... I may guess that line 74 should check for in_interrupt() instead > > > of in_softirq(). > > > > I've tried that and it really fixed the problem. Server already runs > > the same backup

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-19 Thread Ben Hutchings
On Wed, 2009-08-19 at 22:36 +0400, Nikita V. Youshchenko wrote: > tags 542250 +patch > thanks > > > ... I may guess that line 74 should check for in_interrupt() instead of > > in_softirq(). > > I've tried that and it really fixed the problem. Server already runs the > same backup procedure for s

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-19 Thread Nikita V. Youshchenko
tags 542250 +patch thanks > ... I may guess that line 74 should check for in_interrupt() instead of > in_softirq(). I've tried that and it really fixed the problem. Server already runs the same backup procedure for several hours. Previously it crashed within 15 minutes. Here is the patch I've

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-18 Thread Ben Hutchings
On Tue, 2009-08-18 at 20:38 +0400, Nikita V. Youshchenko wrote: > Package: linux-image-2.6.26-2-xen-amd64 > Version: 2.6.26-17lenny2 > Severity: normal > > I'm running several servers (Dual Core AMD Opteron, 16G of ram) with > Xen. > > Dom0 is debian lenny adm64. > > Today I tried to run a large

Bug#542250: repeatable crashes while copying 500G from NFS mount to local logical volume

2009-08-18 Thread Nikita V. Youshchenko
Package: linux-image-2.6.26-2-xen-amd64 Version: 2.6.26-17lenny2 Severity: normal I'm running several servers (Dual Core AMD Opteron, 16G of ram) with Xen. Dom0 is debian lenny adm64. Today I tried to run a large backup in dom0 - copy NFS-mounted partition to a local volume group. buki:~# cd ba