Tom:

I thin my comments were that grouping on a field where there was
a unique value _per document_ chewed up a lot of resources.
Conceptually, there's a bucket for each unique group value. And
grouping on a file path is just asking for trouble.

But the memory used for grouping should max as a function of
the unique values in the grouped field.

Best
Erick

On Wed, Aug 22, 2012 at 11:32 PM, Lance Norskog <goks...@gmail.com> wrote:
> Yes, distributed grouping works, but grouping takes a lot of
> resources. If you can avoid in distributed mode, so much the better.
>
> On Wed, Aug 22, 2012 at 3:35 PM, Tom Burton-West <tburt...@umich.edu> wrote:
>> Thanks Tirthankar,
>>
>> So the issue in memory use for sorting.  I'm not sure I understand how
>> sorting of grouping fields  is involved with the defaults and field
>> collapsing, since the default sorts by relevance not grouping field.  On
>> the other hand I don't know much about how field collapsing is implemented.
>>
>> So far the few tests I've made haven't revealed any memory problems.  We
>> are using very small string fields for grouping and I think that we
>> probably only have a couple of cases where we are grouping more than a few
>> thousand docs.   I will try to find a query with a lot of docs per group
>> and take a look at the memory use using JConsole.
>>
>> Tom
>>
>>
>> On Wed, Aug 22, 2012 at 4:02 PM, Tirthankar Chatterjee <
>> tchatter...@commvault.com> wrote:
>>
>>>  Hi Tom,****
>>>
>>> We had an issue where we are keeping millions of docs in a single node and
>>> we were trying to group them on a string field which is nothing but full
>>> file path… that caused SOLR to go out of memory…****
>>>
>>> ** **
>>>
>>> Erick has explained nicely in the thread as to why it won’t work and I had
>>> to find another way of architecting it. ****
>>>
>>> ** **
>>>
>>> How do you think this is different in your case. If you want to group by a
>>> string field with thousands of similar entries I am guessing you will face
>>> the same issue. ****
>>>
>>> ** **
>>>
>>> Thanks,****
>>>
>>> Tirthankar****
>>> ***************************Legal Disclaimer***************************
>>> "This communication may contain confidential and privileged material for
>>> the
>>> sole use of the intended recipient. Any unauthorized review, use or
>>> distribution
>>> by others is strictly prohibited. If you have received the message in
>>> error,
>>> please advise the sender by reply email and delete the message. Thank you."
>>> **********************************************************************
>>>
>
>
>
> --
> Lance Norskog
> goks...@gmail.com

Reply via email to