https://bugs.kde.org/show_bug.cgi?id=438455

--- Comment #34 from tagwer...@innerjoin.org ---
(In reply to skierpage from comment #0)
> ADDITIONAL INFORMATION
> There are tools to extract text from MSOffice files...
That is a good lead, thanks!

Looks like you can convert a doc to text with:

    $ libreoffice --headless --convert-to "txt:Text (encoded):UTF8"
document.doc

or stream the text to stdout, minimally with:

    $ libreoffice --cat document.doc

but this can give some "extraneous" warning messages. I'm trying out:

    $ libreoffice --headless --safe-mode --cat document.doc

and:

    $ libreoffice --headless
"-env:UserInstallation=file:///tmp/Baloo_Conversion_${USER}" --cat document.doc

It seems that this conversion ought work more generally but I get failures with
.xls or .ppt files, maybe watch:

    https://bugs.documentfoundation.org/show_bug.cgi?id=150846

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to