Get content in response from ExtractingRequestHandler

trung.ht Thu, 09 Jul 2015 19:55:18 -0700

Hi everyone,

I use solr to index and search in office file (docx, pptx, ...). To reduce
the size of solr index, I do not store the content of the file on solr,
however now my customer want to preview the content of the file.


I have read the document of ExtractingRequestHandler, but it seems that to
return content in the response from solr, the only option is to
set extractOnly=true, but in that case, solr would not index the file.

My question is: is there anyway for solr to extract the content from tika,
index the content (without storing it) and then give me the content in the
response?

Thanks in advanced and sorry because my explanation is confusing.

Trung.

Get content in response from ExtractingRequestHandler

Reply via email to