Hi everyone, I use solr to index and search in office file (docx, pptx, ...). To reduce the size of solr index, I do not store the content of the file on solr, however now my customer want to preview the content of the file.
I have read the document of ExtractingRequestHandler, but it seems that to return content in the response from solr, the only option is to set extractOnly=true, but in that case, solr would not index the file. My question is: is there anyway for solr to extract the content from tika, index the content (without storing it) and then give me the content in the response? Thanks in advanced and sorry because my explanation is confusing. Trung.