files not indexed
DateMon, 21 Oct 2013 07:59:20 GMTHi Otis,
In our case, there is no exception raised by tika or solr, a lucene
document is created, but the content field contains only a few white spaces
like for ODF files.
Roland.
On Sat, Oct 19, 2013 at 3:54 AM, Otis Gospodnetic
Hi Otis,
In our case, there is no exception raised by tika or solr, a lucene
document is created, but the content field contains only a few white spaces
like for ODF files.
Roland.
On Sat, Oct 19, 2013 at 3:54 AM, Otis Gospodnetic <
otis.gospodne...@gmail.com> wrote:
> Hi Roland,
>
> It looks
Hi Roland,
It looks like:
Tika - yes
Solr - no?
Based on http://search-lucene.com/?q=xlsb
ODF != XLSB though, I think...
Otis
--
Solr & ElasticSearch Support -- http://sematext.com/
Performance Monitoring -- http://sematext.com/spm
On Fri, Oct 18, 2013 at 7:36 AM, Roland Everaert wrote:
> H
Hi,
Can someone tells me if tika is supposed to extract data from xlsb files
(the new MS Office format in binary form)?
If so then it seems that solr is not able to index them like it is not able
to index ODF files (a JIRA is already opened for ODF
https://issues.apache.org/jira/browse/SOLR-4809)