Hi You can look at the Apache Tika project or the PDFBox project to parse your files before sending to Solr. Alternatively, if your processing is very simple, you can use the built-in Tika as U just did, and then deploy some UpdateRequestProcessor’s in order to modify the Tika output into whatever fields you like.
-- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 16. mar. 2016 kl. 08.18 skrev vidya <vidya.nade...@tcs.com>: > > Sorry for conveying it in wrong way. I want my data of 1 pdf file to be > indexed with different fields in a document of solr according to data in it > like name;id;title;content etc > > Thanks > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/indexing-pdf-files-using-post-tool-tp4263811p4264052.html > Sent from the Solr - User mailing list archive at Nabble.com.