Re: TIKA Errors Importing MS Word Documents into SOLR Cloud

2012-02-27 Thread Erick Erickson
You *probaby* can update the Tika libraries in Solr, but it'll be "interesting" to get all the right ones updated, there are a bunch of them in Tika. And I make no guarantees. If it proves difficult, it's not too hard to write a SolrJ program that does the Tika extraction and run it on a client to

TIKA Errors Importing MS Word Documents into SOLR Cloud

2012-02-26 Thread Matthew Parker
I tried to import some documents into SOLR Cloud using Apache Manifold. TIKA started throwing exceptions for various documents The exception reads like the following: org.apache.solr.common.SolrException at org.apache.solr.handler.extraction.ExtractionDocumentLoader.load( ExtractingDocumentLoade