Hello, Look towards Tika. It can handle these MS Word file formats:
http://tika.apache.org/1.3/formats.html#Microsoft_Office_document_formats Solr Wiki: http://wiki.apache.org/solr/ExtractingRequestHandler I don't have a link for a tutorial with example schemas. Dmitry On Tue, Mar 5, 2013 at 11:59 AM, anarchos78 <rigasathanasio...@hotmail.com>wrote: > Hello, > > I have a folder contains about 50 word doc files. Is there a way to index > them in one shot? The only experience that I have on indexing is with DIH. > Is it possible to provide a link to a tutorial or info on how to do the > above task (data-config and schema examples)? > > Many thanks in advance, > Tom > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Bulk-word-document-indexing-tp4044794.html > Sent from the Solr - User mailing list archive at Nabble.com. >