Check to see if there are any errors in the Solr log for jpg and zip files.
Solr should do something for them - if not, file a Jira to suggest that it
should, as an imporvement. Zip should give a list of the enclosed files.
Images should at least give the metadata.
-- Jack Krupansky
On Wed, Apr 1
Thanks Andrea. For image files and zip files, even metadata is not
available. Just to explain further, I have indexed a total of 10 files, out
of which a .jpg file and .zip file are present.
After the indexing process is complete, no information about either of
these files is present in the solr q
Sorry, attachments are not supported here :(
Anyway, I believe the misunderstanding resides in what you think you
should mean "image indexing": actually, AFAIK, Tika indexes only a) the
textual content of a given resource b) its metadata.
So
- for a JPG file (or in genetal, an image) you will
Thanks Andrea. I can see that Tika1.5 supports both compressed (ZIP) and
image (JPG) formats. If thats the case, why SolrCell could not index the
documents of .zip and .jpg? Am I missing something here? No error is
thrown in the overall process and the java program completes successfully.
But when
Hi Vijay,
here you can find all supported formats by Tika, which is internally
used by SolrCell:
* https://tika.apache.org/*1.4*/formats.html
* https://tika.apache.org/*1.5*/formats.html
* https://tika.apache.org/*1.6*/formats.html
* https://tika.apache.org/*1.7*/formats.html
Best,
Andrea