Bug#659762: lvm2: LVM commands freeze after snapshot delete fails

Frank Steinborn Mon, 24 Jun 2013 08:21:57 -0700

This bug is a really serious issue for us. Let me explain.

We run active/passive MySQL-clusters using Pacemaker and DRBD. For backups,
we create an LVM-snapshot on the passive node, fire up a MySQL server and
dump data from that snapshot.
When the dump is finished, we stop MySQL on the passive node and remove the
snapshot. This fails almost every time due to this bug.


The big problem though is that when the bug appears and the LVM subsystem
freezes, our active node hangs completetly in iowait, due to DRBD unable to
finish IO operations on the passive node. So basically the whole cluster
fails and we get hard downtime as long as we don't take action manually and
do 'lvresume' and/or reboot.

We had a look on some of the proposed workarounds, but they are no option
for us.

What i basically want to ask is - are there any ETAs possible when this
will be resolved? We are seriously considering a downgrade to squeeze. That
would be a _huge_ pain, though.

Thanks,
Frank

Bug#659762: lvm2: LVM commands freeze after snapshot delete fails

Reply via email to