Peter Maydell <[email protected]> wrote: > On Fri, 3 Feb 2023 at 21:14, Juan Quintela <[email protected]> wrote: >> >> Peter Maydell <[email protected]> wrote: >> > The migration tests have been flaky for a while now, >> > including setups where host and guest page sizes are the same. >> > (For instance, my x86 macos box pretty reliably sees failures >> > when the machine is under load.) >> >> I *thought* that we had fixed all of those. >> >> But it is difficult for me to know because: >> - I only happens when one runs "make check" >> - running ./migration-test have never failed to me >> - When it fails (and it has been a while since it has failed to me) >> it is impossible to me to detect what is going on, and as said, I have >> never been able to reproduce running only migration-test. > > Yes. If we could improve the logging in the test so that when > an intermittent failure does happen the test prints better > clues about what happened, I think that would help a lot. > > https://lore.kernel.org/qemu-devel/cafeaca8x_im3hn2-p9f+huxnxfxy+d6fze+leq4erldg7zk...@mail.gmail.com/ > is the thread from late December about the macos failures.
We (red hat) found a similar problem with aarch64, but only when using zero copy. Will try to see if I can reproduce this other there. https://bugzilla.redhat.com/show_bug.cgi?id=2160929 the similar thing to what you have is: - they are trying to cancel - they are on aarch64 but: - they can only reproduce with zero copy enabled. Later, Juan.
