Re: Solr throws TikaException while parsing sample PDF

2010-04-21 Thread Praveen Agrawal
Can somebody please guide me here? On Tue, Apr 20, 2010 at 10:53 AM, Praveen Agrawal wrote: > I'm using Solr 1.4 distribution, with Solr cell. Can i update only new > version of Tika in Solr 1.4 distn? If yes, any guide etc? > Thanks. > > > > On Mon, Apr 19, 2010 at 4:36 PM, Koji Sekiguchi wrot

Re: Solr throws TikaException while parsing sample PDF

2010-04-19 Thread Praveen Agrawal
I'm using Solr 1.4 distribution, with Solr cell. Can i update only new version of Tika in Solr 1.4 distn? If yes, any guide etc? Thanks. On Mon, Apr 19, 2010 at 4:36 PM, Koji Sekiguchi wrote: > Praveen Agrawal wrote: > >> Hi Grant, >> I tried command line of Tika v-0.7(newest), and it parsed th

Re: Solr throws TikaException while parsing sample PDF

2010-04-19 Thread Koji Sekiguchi
Praveen Agrawal wrote: Hi Grant, I tried command line of Tika v-0.7(newest), and it parsed the file.. I believe Solr1.4 contains 0.4 version of Tika. Do you suggest to upgrade to new Tika? Can i upgrade only tika in Solr-1.4? or i need to wait till Solr ships with new Tika? Thanks. Solr trunk

Re: Solr throws TikaException while parsing sample PDF

2010-04-19 Thread Praveen Agrawal
Hi Grant, I tried command line of Tika v-0.7(newest), and it parsed the file.. I believe Solr1.4 contains 0.4 version of Tika. Do you suggest to upgrade to new Tika? Can i upgrade only tika in Solr-1.4? or i need to wait till Solr ships with new Tika? Thanks. On Sun, Apr 18, 2010 at 11:24 PM, Gra

Re: Solr throws TikaException while parsing sample PDF

2010-04-18 Thread Grant Ingersoll
Can you extract content from this using Tika's standalone command line tool? PDF's are notorious for problems in extracting. To me, it looks like a bug in PDFBox. I would try to isolate it down to there and then send, if possible, the sample document to PDFBox and see if they can come up w/ a