Re: DIH Blob data

2014-11-14 Thread Anurag Sharma
Thanks Michael & Eric for the succinct response. On Sat, Nov 15, 2014 at 12:13 AM, Michael Sokolov < msoko...@safaribooksonline.com> wrote: > There is a binary type > > -Mike > > On 11/14/2014 12:21 PM, Anurag Sharma wrote: > >> bq: We routinely store images and pdfs in Solr. There *is* a benefit

Re: DIH Blob data

2014-11-14 Thread Erick Erickson
Right, a more nuanced comment involves what _type_ of docs you're storing, and what the ratio of searchable-to-overall size is. Consider an image. The searchable data may be 0.01% of the file size. Or even worse, a movie. As always, "it depends". I guess that personally I'm not a fan of using Solr

Re: DIH Blob data

2014-11-14 Thread Michael Sokolov
On 11/14/2014 01:43 PM, Erick Erickson wrote: Just skimming, so maybe I misinterpreted. ExternalFileField and ExternalFileFieldReloader refer to storing values for each doc in an external file, they have nothing to do with storing _files_. The usual pattern is to have Solr store just enough da

Re: DIH Blob data

2014-11-14 Thread Michael Sokolov
There is a binary type -Mike On 11/14/2014 12:21 PM, Anurag Sharma wrote: bq: We routinely store images and pdfs in Solr. There *is* a benefit, since you don't need to manage another storage system, you don't have to worry about Solr getting out of sync with the other system, you can use Solr r

Re: DIH Blob data

2014-11-14 Thread Erick Erickson
Just skimming, so maybe I misinterpreted. ExternalFileField and ExternalFileFieldReloader refer to storing values for each doc in an external file, they have nothing to do with storing _files_. The usual pattern is to have Solr store just enough data to have the system-of-record return the actual

Re: DIH Blob data

2014-11-14 Thread Anurag Sharma
bq: We routinely store images and pdfs in Solr. There *is* a benefit, since you don't need to manage another storage system, you don't have to worry about Solr getting out of sync with the other system, you can use Solr replication for all your assets, etc. Do the same holds good for large Blobs l

Re: DIH Blob data

2014-11-12 Thread Jeon Woosung
How about this? First, define a field for filter query. It should be multivalued. Second, implements transformer to extract json dynamic fields, and put the dynamic fields into the solr field. For example, Data : {a:1,b:2,c:3} You can split the data to "a:1", "b:2", "c:3", and put them into

Re: DIH Blob data

2014-11-12 Thread Michael Sokolov
We routinely store images and pdfs in Solr. There *is* a benefit, since you don't need to manage another storage system, you don't have to worry about Solr getting out of sync with the other system, you can use Solr replication for all your assets, etc. I don't use DIH, so personally I don't c

Re: DIH Blob data

2014-11-12 Thread Anurag Sharma
BLOB is non-searchable field so there is no benefit of storing it into Solr. Any external key-value store can be used to store the blob and reference of this blob can be stored as a string field in Solr. On Wed, Nov 12, 2014 at 5:56 PM, stockii wrote: > I had a similar problem and didnt find any

Re: DIH Blob data

2014-11-12 Thread stockii
I had a similar problem and didnt find any solution to use the fields in JSON Blob for a filter ... Not with DIH. -- View this message in context: http://lucene.472066.n3.nabble.com/DIH-Blob-data-tp4168896p4168925.html Sent from the Solr - User mailing list archive at Nabble.com.