I've unsuccessfully attempted to go down this road - there are API changes, 
some 
of which I was able to solve by taking code snippets from Solr 3.1.  Some 
 extraction-related tests for wouldn't pass (look for 'Solr 1.4.1 and Tika 0.9 
- 
some tests not passing' in the archive).  Ultimately, I decided that the then 
newly released Solr 3.1 was the less rocky route.  Not sure if that is an 
option 
for you.

Andreas



________________________________
From: "Mattmann, Chris A (388J)" <chris.a.mattm...@jpl.nasa.gov>
To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org>
Sent: Mon, June 20, 2011 7:18:34 AM
Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1

Hi Surendra,

On Jun 20, 2011, at 4:59 AM, Surendra wrote:

> Hey Chris
> 
> I have added tika-core 0.9 and tika-parsers 0.9 to Solr1.4.1 (extraction/lib)
> after building them using the source provided by TIKA. Now I have an issue 
with
> this. I am working with extracting PDF content using Solr. I have added
> fmap.content to the configurable params as "attr_content" where I can see the
> entire extracted document. After the TIKA update i am not able to see
> attr_content appearing in the search results. When I restore it with old 0.4
> TIKA jars again the attr_content appears. I didn't find any exceptions shown 
up
> there in the console. Is this a known behavior that someone have faced 
already?
> Can you guide me to resolve this?

I don't think you can simple add a new tika-core-0.9 and tika-parsers-0.9 to 
extraction/lib -- I think you'll need to replace the set of prior Tika jars in 
there. Have a look here to see what jars you would need to replace, HTH:

http://tika.apache.org/0.9/gettingstarted.html

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Reply via email to