Take a look at TikaEntityProcessor or the Tika package. I'm on restricted inet access so can't look at the exact class.
Erick On May 24, 2011 6:45 AM, "Thumuluri, Sai" <sai.thumul...@verizonwireless.com> wrote: > Good morning, I am trying to index some PDFs which are protected by > siteminder, any ideas as to how I can go about it? I am using Solr 1.4 >