I've unsuccessfully attempted to go down this road - there are API changes, some of which I was able to solve by taking code snippets from Solr 3.1. Some extraction-related tests for wouldn't pass (look for 'Solr 1.4.1 and Tika 0.9 - some tests not passing' in the archive). Ultimately, I decided that the then newly released Solr 3.1 was the less rocky route. Not sure if that is an option for you.
Andreas ________________________________ From: "Mattmann, Chris A (388J)" <chris.a.mattm...@jpl.nasa.gov> To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org> Sent: Mon, June 20, 2011 7:18:34 AM Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1 Hi Surendra, On Jun 20, 2011, at 4:59 AM, Surendra wrote: > Hey Chris > > I have added tika-core 0.9 and tika-parsers 0.9 to Solr1.4.1 (extraction/lib) > after building them using the source provided by TIKA. Now I have an issue with > this. I am working with extracting PDF content using Solr. I have added > fmap.content to the configurable params as "attr_content" where I can see the > entire extracted document. After the TIKA update i am not able to see > attr_content appearing in the search results. When I restore it with old 0.4 > TIKA jars again the attr_content appears. I didn't find any exceptions shown up > there in the console. Is this a known behavior that someone have faced already? > Can you guide me to resolve this? I don't think you can simple add a new tika-core-0.9 and tika-parsers-0.9 to extraction/lib -- I think you'll need to replace the set of prior Tika jars in there. Have a look here to see what jars you would need to replace, HTH: http://tika.apache.org/0.9/gettingstarted.html Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++