I am using solr 3.4 and configured my DataImportHandler to get some data from MySql as well as index some rich document from the disk.
This is the part of db-data-config file where i am indexing Rich text documents..... <entity name="resume" dataSource="ds-db" query="Select name,js_login_id div 25000 as dir from js_resumes where js_login_id='${js_logins.id}' and is_primary = 1 and deleted=0 and mask_cv != 1" pk="resume_name" deltaQuery="select js_login_id from js_resumes where modified > '${dataimporter.last_index_time}' and is_primary = 1 and deleted=0" parentDeltaQuery="select jsl.id as id from service_request_histories srh,service_requests sr, js_login_screenings jsls, js_logins jsl where jsl.status IN(1,2) and srh.service_request_id = sr.id and jsl.id=jsls.js_login_id and srh.status in ('8','43') and jsls.id=srh.sid and date(srh.created) < date_sub(now(),interval 2 day) and jsl.id = '${js_resumes.js_login_id}'" > <entity processor="TikaEntityProcessor" tikaConfig="tika-config.xml" url="http://localhost/resumes-new/resumes${resume.dir}/${js_logins.id}/${resume.name}" dataSource="ds-file" format="text"> <field column="text" name="resume" /> </entity> </entity> But after some time i get the following error in my error log. It looks like a class missing error, Can anyone tell me which poi jar version would work with tika.0.6. Currently I have poi-3.7.jar. Error which i am getting is this .... SEVERE: Exception while processing: js_logins document : SolrInputDocument[{id=id(1.0)={100984}, complete_mobile_number=complete_mobile_number(1.0)={+91 9600067575}, emailid=emailid(1.0)={vkry...@gmail.com}, full_name=full_name(1.0)={Venkat Ryali}}]:org.apache.solr.handler.dataimport.DataImportHandlerException: java.lang.NoSuchMethodError: org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(Lorg/openxmlformats/schemas/wordprocessingml/x2006/main/CTP;Lorg/apache/poi/xwpf/usermodel/XWPFDocument;)V at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:669) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:622) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:622) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:268) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:187) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:359) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:427) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:408) Caused by: java.lang.NoSuchMethodError: org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(Lorg/openxmlformats/schemas/wordprocessingml/x2006/main/CTP;Lorg/apache/poi/xwpf/usermodel/XWPFDocument;)V at org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator$MyXWPFParagraph.<init>(XWPFWordExtractorDecorator.java:163) at org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator$MyXWPFParagraph.<init>(XWPFWordExtractorDecorator.java:161) at org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator.extractTableContent(XWPFWordExtractorDecorator.java:140) at org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator.buildXHTML(XWPFWordExtractorDecorator.java:91) at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:69) at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:51) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101) at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:128) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:238) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:596) ... 7 more -- View this message in context: http://lucene.472066.n3.nabble.com/Rich-document-indexing-tp3512276p3512276.html Sent from the Solr - User mailing list archive at Nabble.com.