Re: How to index autoshape text in Excel 2007+

Alexandre Rafalovitch Sun, 21 Jul 2013 05:54:29 -0700

Solr uses Apache Tika for text extraction. Their mailing list (and issues
list) might be better place to resolve this. And if it is a bug, they
probably would appreciate an example they could practice on.


Regards,
   Alex.

Personal website: http://www.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is the quality of nature that keeps events from happening all at
once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)


On Sun, Jul 21, 2013 at 7:01 AM, Hiroshi Tatsumi <
honekich...@comet.ocn.ne.jp> wrote:

> Hi,
>
> I am using Solr 4.3.0.
> I'd like to index autoshape text in Excel 2007+(.xlsx) by using
> ExtractingRequestHandler, but I can't.
>
> I tried to do for some MS office files.
> The results are below.
>
> Success (I can index autoshape text.)
> - Excel 2003(.xls)
> - Word 2003(.doc)
> - Word 2007+(.docx)
>
> Failed (I cannot index autoshape text.)
> - Excel 2007+(.xlsx)
>
> Is this a bug?
> If you know, could you tell me how to index autoshape text in Excel 2007+?
>
> Thanks,
> Hiro.
>

Re: How to index autoshape text in Excel 2007+

Reply via email to