On 06/12/2017 10:10, Gora Mohanty wrote:
On 6 December 2017 at 10:39, Munish Kumar Arora
<munisharora....@gmail.com> wrote:

So the questions are,
1. Can I get the PDF content?
2. does Solr stores the actual file somewhere?
            a. If it stores then where it does?
             b. If it does not store then, is there a way to store THE FILE?

Normal practice would be to store the PDF file somewhere on the file
system where it can be served through a HTTP request. Then, store the
filesystem path to the PDF file in Solr so that it can be returned in
a Solr search request.

Regards,
Gora

Yes you *can* store the entire contents of an indexed file in Solr. No, you really, really shouldn't. Always make sure you can regenerate your index from the original sources if you need to - a search engine is not a database.

I'll just write that again: a search engine is not a database.

The method described above is the usual way to deal with this situation.

Best

Charlie
--
Charlie Hull
Flax - Open Source Enterprise Search

tel/fax: +44 (0)8700 118334
mobile:  +44 (0)7767 825828
web: www.flax.co.uk

Reply via email to