Re: Issue with Solr 3.5 while using TikaEntityProcessor on .docx files

2012-04-16 Thread Roman K
On 04/16/2012 06:45 PM, Roman K wrote: On 04/16/2012 04:31 PM, Jan Høydahl wrote: Hi, Solr3.6 is just out with Tika 1.0. Can you try that? Also, Solr TRUNK now has Tika 1.1... I recommend downloading Tika-App and testing your offending files directly with that http://tika.apache.org/1.1/getti

Re: Issue with Solr 3.5 while using TikaEntityProcessor on .docx files

2012-04-16 Thread Roman K
On 04/16/2012 04:31 PM, Jan Høydahl wrote: Hi, Solr3.6 is just out with Tika 1.0. Can you try that? Also, Solr TRUNK now has Tika 1.1... I recommend downloading Tika-App and testing your offending files directly with that http://tika.apache.org/1.1/gettingstarted.html -- Jan Høydahl, search s

Re: Issue with Solr 3.5 while using TikaEntityProcessor on .docx files

2012-04-16 Thread Jan Høydahl
Hi, Solr3.6 is just out with Tika 1.0. Can you try that? Also, Solr TRUNK now has Tika 1.1... I recommend downloading Tika-App and testing your offending files directly with that http://tika.apache.org/1.1/gettingstarted.html -- Jan Høydahl, search solution architect Cominvent AS - www.cominven

Issue with Solr 3.5 while using TikaEntityProcessor on .docx files

2012-04-16 Thread Roman K
Hello, I am running some tests to see, whether we can use Solr in our organization. I have to be able to process MS Word .docx files and then be able to search them as they were simple plain text. The problem is that when processing the docx files, the result that I get while running the *:* q