On 2025-02-24 17:51:17, Michael Tremer wrote: > Hello Antoine, > >> On 24 Feb 2025, at 16:51, Antoine Beaupré <anar...@debian.org> wrote:
[...] >> So, TL;DR: improved, but not fixed. I suspect we had a multi-dimensional >> issue, of which search/whoosh *was* a part of, because we would see a >> huge increase in OOMs when rebuilding the indexes. But we're still >> having an issue, so perhaps there's something else. > > This is my experience with Xapian and I have found confirmation that this is > supposed to be normal. My mailbox indexes were massive and there was no point > having them any more. So I can confirm that this looked very similar in > Dovecot, too. A 10x amplification in the disk usage is not normal. https://gitlab.com/mailman/hyperkitty/-/issues/533 I use notmuch as a search index here, and the amplification is *opposite*, 4.5x *reduction* in disk usage compared to the original dataset.