Re: solr cell: write entire file content binary to index along with metadata

2018-04-25 Thread Rahul Singh
Lucene ( the major underlying Tech in SolR ) can handle any data, but it’s optimized to be an index , not a file store. Better to put that in another DB or file system like Cassandra, S3, etc. (better than SolR). In our experience , leveraging the tika binary / microservice as a pre-index proce

Re: solr cell: write entire file content binary to index along with metadata

2018-04-25 Thread Shawn Heisey
On 4/25/2018 4:02 AM, Lee Carroll wrote: *We don't recommend using solr-cell for production indexing.* Ok. Are the reasons for: Performance. I think we have rather modest index requirement (1000 a day... on a busy day) Security. The index workflow is, upload files to public facing server w

Re: solr cell: write entire file content binary to index along with metadata

2018-04-25 Thread Lee Carroll
> > > > > *That's not usually the kind of information you want to have in a > Solrindex. Most of the time, there will be an entry in the Solr index > thattells the system making queries how to locate the actual data -- > afilename, a URL, a database lookup key, etc.* Agreed. The app will have a

Re: solr cell: write entire file content binary to index along with metadata

2018-04-24 Thread Shawn Heisey
On 4/24/2018 10:26 AM, Lee Carroll wrote: > Does the solr cell contrib give access to the files raw content along with > the extracted metadata?\ That's not usually the kind of information you want to have in a Solr index.  Most of the time, there will be an entry in the Solr index that tells the

solr cell: write entire file content binary to index along with metadata

2018-04-24 Thread Lee Carroll
Does the solr cell contrib give access to the files raw content along with the extracted metadata? cheers Lee C