https://bugs.kde.org/show_bug.cgi?id=438455
--- Comment #34 from tagwer...@innerjoin.org --- (In reply to skierpage from comment #0) > ADDITIONAL INFORMATION > There are tools to extract text from MSOffice files... That is a good lead, thanks! Looks like you can convert a doc to text with: $ libreoffice --headless --convert-to "txt:Text (encoded):UTF8" document.doc or stream the text to stdout, minimally with: $ libreoffice --cat document.doc but this can give some "extraneous" warning messages. I'm trying out: $ libreoffice --headless --safe-mode --cat document.doc and: $ libreoffice --headless "-env:UserInstallation=file:///tmp/Baloo_Conversion_${USER}" --cat document.doc It seems that this conversion ought work more generally but I get failures with .xls or .ppt files, maybe watch: https://bugs.documentfoundation.org/show_bug.cgi?id=150846 -- You are receiving this mail because: You are watching all bug changes.