https://bugs.kde.org/show_bug.cgi?id=438074

--- Comment #20 from tagwer...@innerjoin.org ---
(In reply to Frank Steinmetzger from comment #19)
> It’s all maildir, but with over 100k files. ^^
A hurried google of "maildir format" gives me that it holds one message per
file, with the format like .eml. At least kmimetypefinder gives
"message/rfc822". I think Bug 460882 would still apply and you could be writing
loads of "random" strings (from encoded attachments, whatever) and repeatedly
rewriting the entries for "common terms".

If each of your messages has a "Subject" line, a search for "Subject" will
retrieve them all. The database record for "Subject" will have been rewritten,
with a commit, after each batch of files indexed. That will be a lot of
rewriting. Baloo knows this is an issue and batches up and indexes 40 files at
a time to cut down on the amount of rewriting required. I suppose, for loads of
small files, it could batch up more...

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to