Hi,

I am trying to index various binary file types into Solr. However, some
file types seems to be ignored and not getting indexed, though the metadata
is being extracted successfuly for all the types.

Specifically, zip files and jpg files are not getting indexed, where as
pdf, MS office documents are getting indexed. Hence wondering whether there
is a defined list of indexable file types.

Moreover, I am just wondering why Solr could not index the jpg and zip
documents when it was able to extract the metadata from those files?

The code snippet is as below:

contentStreamUpdateReq.addFile(file, fileType);
contentStreamUpdateReq.setParam("literal.id", literalId);
contentStreamUpdateReq.setParam("uprefix", "attr_");
contentStreamUpdateReq.setParam("fmap.content", "content");
contentStreamUpdateReq.setAction(AbstractUpdateRequest.ACTION.COMMIT, true,
true);
solrServer.request(contentStreamUpdateReq);

Thanks & Regards
Vijay

-- 
The contents of this e-mail are confidential and for the exclusive use of 
the intended recipient. If you receive this e-mail in error please delete 
it from your system immediately and notify us either by e-mail or 
telephone. You should not copy, forward or otherwise disclose the content 
of the e-mail. The views expressed in this communication may not 
necessarily be the view held by WHISHWORKS.

Reply via email to