Hi all,
I am using Solr 1.5 from trunk. I am getting the below error on a full
load, and it is causing the import to fail and rollback. I am not
concerned about the error but rather that I cannot seem to tell the
indexing to continue. I have two entities, and I have tried all (4)
combinations of "skip" and "continue" for their onError attributes.
SEVERE: Exception while processing: f document : null
org.apache.solr.handler.dataimport.DataImportHandlerException:
java.lang.NoClassDefFoundError:
org/bouncycastle/jce/provider/BouncyCastleProvider
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.j
ava:652)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.j
ava:606)
at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java
:261)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:18
5)
at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporte
r.java:333)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java
:391)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:
372)
Caused by: java.lang.NoClassDefFoundError:
org/bouncycastle/jce/provider/BouncyCastleProvider
at
org.apache.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:1108
)
at
org.apache.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:573)
at
org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:23
5)
at
org.apache.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:180)
at
org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:56)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:69)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
at
org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntit
yProcessor.java:124)
at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(Entity
ProcessorWrapper.java:233)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.j
ava:580)
... 6 more
Mar 2, 2010 10:21:05 PM org.apache.solr.handler.dataimport.DataImporter
doFullImport
SEVERE: Full Import failed
org.apache.solr.handler.dataimport.DataImportHandlerException:
java.lang.NoClassDefFoundError:
org/bouncycastle/jce/provider/BouncyCastleProvider
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.j
ava:652)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.j
ava:606)
at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java
:261)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:18
5)
at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporte
r.java:333)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java
:391)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:
372)
Caused by: java.lang.NoClassDefFoundError:
org/bouncycastle/jce/provider/BouncyCastleProvider
at
org.apache.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:1108
)
at
org.apache.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:573)
at
org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:23
5)
at
org.apache.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:180)
at
org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:56)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:69)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
at
org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntit
yProcessor.java:124)
at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(Entity
ProcessorWrapper.java:233)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.j
ava:580)
... 6 more
Mar 2, 2010 10:21:05 PM org.apache.solr.update.DirectUpdateHandler2
rollback
INFO: start rollback
My data-config file:
<dataConfig>
<dataSource name="binaryFile" type="BinFileDataSource" />
<document>
<entity name="f" processor="FileListEntityProcessor"
transformer="RegexTransformer,TemplateTransformer" baseDir="C:\Docs"
fileName=".*pdf" recursive="true" rootEntity="false" pk="id"
dataSource="binaryFile" onError="skip">
<field column="id" sourceColName="fileAbsolutePath" regex="\\"
replaceWith="/" />
<entity dataSource="binaryFile" name="x"
processor="TikaEntityProcessor" url="${f.fileAbsolutePath}"
onError="continue" >
<field column="text" name="text" />
</entity>
</entity>
</document>
</dataConfig>
Thanks,
Nirmal