Re: indexing pdf files using post tool

Jan Høydahl Fri, 18 Mar 2016 21:46:58 -0700

Hi

You can look at the Apache Tika project or the PDFBox project to parse your 
files before sending to Solr.
Alternatively, if your processing is very simple, you can use the built-in Tika 
as U just did, and
then deploy some UpdateRequestProcessor’s in order to modify the Tika output 
into whatever fields you like.


--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com

> 16. mar. 2016 kl. 08.18 skrev vidya <vidya.nade...@tcs.com>:
> 
> Sorry for conveying it in wrong way. I want my data of 1 pdf file to be
> indexed with different fields in a document of solr according to data in it
> like name;id;title;content etc
> 
> Thanks 
> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/indexing-pdf-files-using-post-tool-tp4263811p4264052.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Re: indexing pdf files using post tool

Reply via email to