https://bugs.kde.org/show_bug.cgi?id=438455
--- Comment #30 from skierpage <skierp...@gmail.com> --- This bug report has gotten very hard to follow. But 1. if I follow my own steps (with LibreOffice Writer 7.4.2.3), baloo doesn't index. 2. if I download Guido's attachment 153915 baloo_test_Libreoffice_7.4.2.3.doc , baloo doesn't index. 3 if I download Guido's attachment 153916 baloo_test_WPS_Office.doc, baloo does index. 4. I have old MS Office docs that baloo does index. In all cases, the output of `catdoc FILENAME` matches baloo's indexing behavior -- the files baloo doesn't index are the ones for which catdoc has no output is empty and its exit code is 69. @tagwerk19, what are your results with attachment 153915 ? I wrote > I couldn't find any Linux utility that identifies the version of the Word > file format that a .doc file uses `file FILENAME` gives a lot of info; the non-indexed LibreOffice documents have Code page -535. I don't know if this is significant. I stepped through catdoc with gdb and for my file it didn't find an oleEntry matching WordDocument and exited with error code 69. It is unhelpful that kfilemetadata's officeextractor.cpp doesn't log when `catdoc`it fails to index anything! kmimetypefinder identifies all of these .doc files as application/msword -- You are receiving this mail because: You are watching all bug changes.