On Jul 14, 2009, at 8:00 AM, Kevin Miller wrote:

I am needing to index primarily .doc files but also need it to look at
.pdf and .xls files.  I am currently looking at the Tika project for
this functionality.

This is now built into trunk (aka Solr 1.4): 
http://wiki.apache.org/solr/ExtractingRequestHandler

        Erik

Reply via email to