On Jul 14, 2009, at 8:00 AM, Kevin Miller wrote:
I am needing to index primarily .doc files but also need it to look at .pdf and .xls files. I am currently looking at the Tika project for this functionality.
This is now built into trunk (aka Solr 1.4): http://wiki.apache.org/solr/ExtractingRequestHandler Erik