Package: qemu-system-x86
Version: 1:2.1+dfsg-12~bpo70+1

I have seen unresponsive VMs after live migration several times. CPU load goes up to maximum, with no reaction at all on network or console.

There is a known bug that looks similar:
[PATCH] kvmclock: Ensure time in migration never goes backward
http://comments.gmane.org/gmane.comp.emulators.kvm.devel/121498

The corresponding patch was released with qemu 2.1.3:
http://lists.gnu.org/archive/html/qemu-stable/2015-01/msg00143.html
c35ba0d: kvmclock: Ensure time in migration never goes backward (Alexander Graf)

It is this patch:
* kvmclock: Ensure time in migration never goes backward

http://git.qemu-project.org/?p=qemu.git;a=commit;h=9a48bcd1b82494671c111109b0eefdb882581499

I tried it and at first it looked very good. But obviously there is a reason that it took so long for the patch to get released: some migrated VMs died some hours later with: qemu-system-x86_64: qemu-2.1+dfsg/hw/i386/kvm/clock.c:68: kvmclock_current_nsec: Assertion `time.tsc_timestamp <= migration_tsc' failed.

Looking further, some of the patches released with 2.1.3 are probably related. I ended up using also these:
* Introduce cpu_clean_all_dirty

http://git.qemu-project.org/?p=qemu.git;a=commit;h=de9d61e83d43be9069e6646fa9d57a3f47779d28
* kvmclock: Ensure proper env->tsc value for kvmclock_current_nsec

http://git.qemu-project.org/?p=qemu.git;a=commit;h=317b0a6d8ba44e9bf8f9c3dbd776c4536843d82c
* kvmclock: Add comment explaining why we need cpu_clean_all_dirty()

http://git.qemu-project.org/?p=qemu.git;a=commit;h=1154d84dcc5f46e83db94281d071775819dd8884

I think this says the same:
https://bugzilla.redhat.com/show_bug.cgi?id=1109427

Again, I made some migration tests with these patches, and had no more problems.

Cheers
 Harry


--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to