Solr uses Apache Tika for text extraction. Their mailing list (and issues list) might be better place to resolve this. And if it is a bug, they probably would appreciate an example they could practice on.
Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Sun, Jul 21, 2013 at 7:01 AM, Hiroshi Tatsumi < honekich...@comet.ocn.ne.jp> wrote: > Hi, > > I am using Solr 4.3.0. > I'd like to index autoshape text in Excel 2007+(.xlsx) by using > ExtractingRequestHandler, but I can't. > > I tried to do for some MS office files. > The results are below. > > Success (I can index autoshape text.) > - Excel 2003(.xls) > - Word 2003(.doc) > - Word 2007+(.docx) > > Failed (I cannot index autoshape text.) > - Excel 2007+(.xlsx) > > Is this a bug? > If you know, could you tell me how to index autoshape text in Excel 2007+? > > Thanks, > Hiro. >