Celejar writes: > Recoll currently uses antiword and catdoc for MS Word documents. Antiword > claims to only support Word 2003 (according to the website) or lower, and > catdoc only to Word 97 (according to the Debian package info). Unoconv > claims > to be able to support any documents that OO.org supports, so it should cover > modern Word formats not covered by the current utilities.
As far as I know, "modern word formats", which I take to be "Open Xml", are covered by a native filter (rclopxml), based on xsltproc. If there are specific issues and problem documents, it would be nice to please provide a sample. Testing on Open Xml documents has indeed been very minimal, so I would not be very surprised if there are issues. Cheers, jf -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org