I have a list of files in a database that I am indexing (it is a liferay
database and the file lists are attachments). I'm encountering the following
error

https://issues.apache.org/jira/browse/PDFBOX-709

on one of the PDF documents and this causes indexing to stop (the
TikaEntityProcessor) throws a Severe exception. Is it possible to ignore
this exception and continue indexing by some kind of solr configuration ?

It seems reasonable to do this in my case as I do not want indexing to stop
due to a non-critical error beyond my control. Currently I've modified the
TikaEntityProcessor to return null in this case. BTW shouldn't the
inputstream close be in a finally block?
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexing-stops-after-exception-tp867608p867608.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to