Sandhya,
How did you proceed?I did this:- jar -xf solr.war.- i then added all of the 
libs i had into the web-inf/lib folder- i then recreated the jar with jar -cvf 
solr.war *- replaced the war files- deleted the libs in the shared lib folder- 
started tomcat
i'm now getting an error saying this:
SEVERE: org.apache.solr.common.SolrException: Error loading class 
'org.apache.solr.handler.extraction.ExtractingRequestHandler'        at 
org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:375)  
      at org.apache.solr.core.SolrCore.createInstance(SolrCore.java:418)        
at org.apache.solr.core.SolrCore.createRequestHandler(SolrCore.java:454)        
at 
org.apache.solr.core.RequestHandlers.initHandlersFromConfig(RequestHandlers.java:152)
Thanks Grant for investigating the problem!
Marc

> From: sagar...@opentext.com
> To: solr-user@lucene.apache.org
> Date: Tue, 4 May 2010 13:10:25 +0530
> Subject: RE: Problem with pdf, upgrading Cell
> 
> Yes, Grant. You are right. Copying the tika libraries to solr webapp, solved 
> the issue and the content extraction works fine now.
> 
> Thanks,
> Sandhya
> 
> -----Original Message-----
> From: Sandhya Agarwal [mailto:sagar...@opentext.com] 
> Sent: Tuesday, May 04, 2010 12:58 PM
> To: solr-user@lucene.apache.org
> Subject: RE: Problem with pdf, upgrading Cell
> 
> Hello,
> 
> 
> 
> But I see that the libraries are being loaded :
> 
> 
> 
> INFO: Adding specified lib dirs to ClassLoader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/asm-3.1.jar' 
> to classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/bcmail-jdk15-1.45.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/bcprov-jdk15-1.45.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/commons-compress-1.0.jar' 
> to classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/commons-logging-1.1.1.jar' 
> to classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/dom4j-1.6.1.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/fontbox-1.1.0.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/geronimo-stax-api_1.0_spec-1.0.1.jar'
>  to classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/jempbox-1.1.0.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/log4j-1.2.14.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/metadata-extractor-2.4.0-beta-1.jar'
>  to classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/pdfbox-1.1.0.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/poi-3.6.jar' 
> to classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/poi-ooxml-3.6.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/poi-ooxml-schemas-3.6.jar' 
> to classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/poi-scratchpad-3.6.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/tagsoup-1.2.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/tika-core-0.7.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/tika-parsers-0.7.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/xercesImpl-2.8.1.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/xml-apis-1.0.b2.jar' to 
> classloader
> 
> May 4, 2010 12:49:59 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/extraction/lib/xmlbeans-2.3.0.jar' to 
> classloader
> 
> May 4, 2010 12:50:16 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 'file:/C:/apache-solr-1.4.0/dist/apache-solr-cell-1.4.0.jar' to 
> classloader
> 
> May 4, 2010 12:50:20 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/dist/apache-solr-clustering-1.4.0.jar' to 
> classloader
> 
> May 4, 2010 12:51:52 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/clustering/lib/carrot2-mini-3.1.0.jar' to 
> classloader
> 
> May 4, 2010 12:51:52 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/clustering/lib/commons-lang-2.4.jar' to 
> classloader
> 
> May 4, 2010 12:51:52 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/clustering/lib/ehcache-1.6.2.jar' to 
> classloader
> 
> May 4, 2010 12:51:52 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/clustering/lib/google-collections-1.0-rc2.jar'
>  to classloader
> 
> May 4, 2010 12:51:52 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/clustering/lib/jackson-core-asl-0.9.9-6.jar'
>  to classloader
> 
> May 4, 2010 12:51:52 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/clustering/lib/jackson-mapper-asl-0.9.9-6.jar'
>  to classloader
> 
> May 4, 2010 12:51:52 PM org.apache.solr.core.SolrResourceLoader 
> replaceClassLoader
> 
> INFO: Adding 
> 'file:/C:/apache-solr-1.4.0/contrib/clustering/lib/log4j-1.2.14.jar' to 
> classloader
> 
> 
> 
> Thanks,
> 
> Sandhya
> 
> 
> 
> -----Original Message-----
> From: Grant Ingersoll [mailto:gsi...@gmail.com] On Behalf Of Grant Ingersoll
> Sent: Tuesday, May 04, 2010 6:13 AM
> Cc: solr-user@lucene.apache.org
> Subject: Re: Problem with pdf, upgrading Cell
> 
> 
> 
> Little more info... Seems to be a classloading issue.  The tests pass, but 
> they aren't loading the Tika libraries via the Solr ResourceLoader, whereas 
> the example is.  Marc, one thing to try is to unjar the Solr WAR file and put 
> the Tika libs in there, as I bet it will then work.  Note, however, I haven't 
> tried this.
> 
> 
> 
> On May 3, 2010, at 6:24 PM, Grant Ingersoll wrote:
> 
> 
> 
> > I've opened https://issues.apache.org/jira/browse/SOLR-1902 to track this.  
> > It is indeed a bug somewhere (still investigating).  It seems that Tika is 
> > now picking an EmptyParser implementation when trying to determine which 
> > parser to use, despite the fact that it properly identifies the MIME Type.
> 
> >
> 
> > -Grant
> 
> >
> 
> > On May 3, 2010, at 5:36 PM, Grant Ingersoll wrote:
> 
> >
> 
> >> I'm investigating.
> 
> >>
> 
> >> On May 3, 2010, at 5:17 AM, Marc Ghorayeb wrote:
> 
> >>
> 
> >>>
> 
> >>> Hi,
> 
> >>> Grant, i confirm what Praveen has said, any PDF i try does not work with 
> >>> the new Tika and SVN versions. :(
> 
> >>> Marc
> 
> >>>
> 
> >>>> From: sagar...@opentext.com
> 
> >>>> To: solr-user@lucene.apache.org
> 
> >>>> Date: Mon, 3 May 2010 13:05:24 +0530
> 
> >>>> Subject: RE: Problem with pdf, upgrading Cell
> 
> >>>>
> 
> >>>> Hello,
> 
> >>>>
> 
> >>>> Please let me know if anybody figured out a way out of this issue.
> 
> >>>>
> 
> >>>> Thanks,
> 
> >>>> Sandhya
> 
> >>>>
> 
> >>>> -----Original Message-----
> 
> >>>> From: Praveen Agrawal [mailto:pkal...@gmail.com]
> 
> >>>> Sent: Friday, April 30, 2010 11:14 PM
> 
> >>>> To: solr-user@lucene.apache.org
> 
> >>>> Subject: Re: Problem with pdf, upgrading Cell
> 
> >>>>
> 
> >>>> Grant,
> 
> >>>> You can try any of the sample pdfs that come in /docs folder of Solr 1.4
> 
> >>>> dist'n. I had tried 'Installing Solr in Tomcat.pdf', 'index.pdf' etc. 
> >>>> Only
> 
> >>>> metadata i.e. stream_size, content_type apart from my own literals are
> 
> >>>> indexed, and content is missing..
> 
> >>>>
> 
> >>>>
> 
> >>>> On Fri, Apr 30, 2010 at 8:52 PM, Grant Ingersoll 
> >>>> <gsing...@apache.org>wrote:
> 
> >>>>
> 
> >>>>> Praveen and Marc,
> 
> >>>>>
> 
> >>>>> Can you share the PDF (feel free to email my private email) that fails 
> >>>>> in
> 
> >>>>> Solr?
> 
> >>>>>
> 
> >>>>> Thanks,
> 
> >>>>> Grant
> 
> >>>>>
> 
> >>>>>
> 
> >>>>> On Apr 30, 2010, at 7:55 AM, Marc Ghorayeb wrote:
> 
> >>>>>
> 
> >>>>>>
> 
> >>>>>> Hi
> 
> >>>>>> Nope i didn't get it to work... Just like you, command line version of
> 
> >>>>> tika extracts correctly the content, but once included in Solr, no 
> >>>>> content
> 
> >>>>> is extracted.
> 
> >>>>>> What i tried until now is:- Updating the tika libraries inside Solr 1.4
> 
> >>>>> public version, no luck there.- Downloading the latest SVN version, 
> >>>>> compiled
> 
> >>>>> it, and started from a simple schema, still no luck.- Getting other 
> >>>>> versions
> 
> >>>>> compiled on hudson (nightly builds), and testing them also, still no
> 
> >>>>> extraction.
> 
> >>>>>> I sent a mail on the developpers mailing list but they told me i should
> 
> >>>>> just mail here, hope some developper reads this because it's quite an
> 
> >>>>> important feature of Solr and somehow it got broke between the 1.4 
> >>>>> release,
> 
> >>>>> and the last version on the svn.
> 
> >>>>>> Marc
> 
> >>>>>> _________________________________________________________________
> 
> >>>>>> Consultez gratuitement vos emails Orange, Gmail, Free, ... directement
> 
> >>>>> dans HOTMAIL !
> 
> >>>>>> http://www.windowslive.fr/hotmail/agregation/
> 
> >>>>>
> 
> >>>>> --------------------------
> 
> >>>>> Grant Ingersoll
> 
> >>>>> http://www.lucidimagination.com/
> 
> >>>>>
> 
> >>>>> Search the Lucene ecosystem using Solr/Lucene:
> 
> >>>>> http://www.lucidimagination.com/search
> 
> >>>>>
> 
> >>>>>
> 
> >>>
> 
> >>> _________________________________________________________________
> 
> >>> Hotmail et MSN dans la poche? HOTMAIL et MSN sont dispo gratuitement sur 
> >>> votre téléphone!
> 
> >>> http://www.messengersurvotremobile.com/?d=Hotmail
> 
> >>
> 
> >> --------------------------
> 
> >> Grant Ingersoll
> 
> >> http://www.lucidimagination.com/
> 
> >>
> 
> >> Search the Lucene ecosystem using Solr/Lucene: 
> >> http://www.lucidimagination.com/search
> 
> >>
> 
> >
> 
> > --------------------------
> 
> > Grant Ingersoll
> 
> > http://www.lucidimagination.com/
> 
> >
> 
> > Search the Lucene ecosystem using Solr/Lucene: 
> > http://www.lucidimagination.com/search
> 
> >
> 
> 
> 
> --------------------------
> 
> Grant Ingersoll
> 
> http://www.lucidimagination.com/
> 
> 
> 
> Search the Lucene ecosystem using Solr/Lucene: 
> http://www.lucidimagination.com/search
> 
> 
                                          
_________________________________________________________________
Hotmail arrive sur votre téléphone ! Compatible Iphone, Windows Phone, 
Blackberry, …
http://www.messengersurvotremobile.com/?d=Hotmail

Reply via email to