Hello,

Look towards Tika. It can handle these MS Word file formats:

http://tika.apache.org/1.3/formats.html#Microsoft_Office_document_formats

Solr Wiki:

http://wiki.apache.org/solr/ExtractingRequestHandler

I don't have a link for a tutorial with example schemas.

Dmitry

On Tue, Mar 5, 2013 at 11:59 AM, anarchos78
<rigasathanasio...@hotmail.com>wrote:

> Hello,
>
> I have a folder contains about 50 word doc files. Is there a way to index
> them in one shot? The only experience that I have on indexing is with DIH.
> Is it possible to provide a link to a tutorial or info on how to do the
> above task (data-config and schema examples)?
>
> Many thanks in advance,
> Tom
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Bulk-word-document-indexing-tp4044794.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to