org.apache.pdfbox.pdmodel.PDPage Error

2011-10-20 Thread MBD
Hi, I'm new to Solr and trying to get it to index PDFs. Having trouble getting started. Following examples in ExtractingRequestHandler wiki . Got Solr running and it indexes html, xml & txt files just fine...but when I try to feed it a .pdf

Re: org.apache.pdfbox.pdmodel.PDPage Error

2011-10-24 Thread MBD
va expert...but would like to get this stabilized...if possible. If this is the wrong mailing list then just tell me and I'll go away... Thanks! On Oct 20, 2011, at 2:54 PM, MBD wrote: > Hi, I'm new to Solr and trying to get it to index PDFs. Having trouble > getting start

Setting up Solr for first time

2011-11-02 Thread MBD
Looking for help getting a basic (the example) configuration up and stabilized so we can start experimenting with it. Requirement being that it index PDFs. After basic install Solr (3.4) is indexing raw text/html files. But when feeding in a PDF I'm getting a permissions error but not sure how t