Hi Mike, how many docs and groups do you have in your index?
I think the group.sort option fits your requirements.

If I remember correctly group.ngroup=true adds something like 30% extra time
on top of the search request with grouping,
but that was on my local test dataset (~30M docs, ~8000 groups)  and my
machine. You might encounter different search times when setting
group.ngroup=true.

Martijn

2011/8/1 Mike Sokolov <soko...@ifactory.com>

> Thanks, Tomas.  Yes we are planning to keep a "current" flag in the most
> current document.  But there are cases where, for a given user, the most
> current document is not that one, because they only have access to some
> older documents.
>
> I took a look at 
> http://wiki.apache.org/solr/**FieldCollapsing<http://wiki.apache.org/solr/FieldCollapsing>and
>  it seems as if it will do what we need here.  My one concern is that it
> might not be efficient at computing group.ngroups for a very large number of
> groups, which we would ideally want.  Is that something I should be worried
> about?
>
> -Mike
>
>
> On 08/01/2011 10:08 AM, Tomás Fernández Löbbe wrote:
>
>> Hi Michael, I guess this could be solved using grouping as you said.
>> Documents inside a group can be sorted on a field (in your case, the
>> version
>> field, see parameter group.sort), and you can show only the first one. It
>> will be more complex to show facets (post grouping faceting is work in
>> progress but still not committed to the trunk).
>>
>> I would be easier from the Solr side if you could do something at index
>> time, like indicating which document is the "current" one and which one is
>> an old one (you would need to update the old document whenever a new
>> version
>> is indexed).
>>
>> Regards,
>>
>> Tomás
>>
>> On Mon, Aug 1, 2011 at 10:47 AM, Mike Sokolov<soko...@ifactory.com>
>>  wrote:
>>
>>
>>
>>> A customer has an interesting problem: some documents will have multiple
>>> versions. In search results, only the most recent version of a given
>>> document should be shown. The trick is that each user has access to a
>>> different set of document versions, and each user should see only the
>>> most
>>> recent version of a document that they have access to.
>>>
>>> Is this something that can reasonably be solved with grouping?  In 3.x? I
>>> haven't followed the grouping discussions closely: would someone point me
>>> in
>>> the right direction please?
>>>
>>> --
>>> Michael Sokolov
>>> Engineering Director
>>> www.ifactory.com
>>>
>>>
>>>
>>>
>>
>>
>


-- 
Met vriendelijke groet,

Martijn van Groningen

Reply via email to