Re: Indexing bulk loads of PDF files and extracting information from them

2013-09-24 Thread Erick Erickson
Consider using a SolrJ program, perhaps multiple ones running in parallel. See: http://searchhub.org/dev/2012/02/14/indexing-with-solrj/ Best, Erick On Mon, Sep 23, 2013 at 3:31 PM, Sadika Amreen wrote: > Hi all, > > > > I am looking to index the entire directory of PDF files. We have a very la

Indexing bulk loads of PDF files and extracting information from them

2013-09-23 Thread Sadika Amreen
Hi all, I am looking to index the entire directory of PDF files. We have a very large volume of PDFs (3000+, possibly much more), so adding them manually would be cumbersome. I have seen more than a couple of dozen links explaining how to index PDF using SOLR, but none were details enough