Hi Ken,

Not collapsing on null field values is not possible in the patch.
However you can if you want to fix this in the patch it is a really
small change. Assuming that you're using the default collapsing
algorithm you can add the following piece of code in the
NonAdjacentDocumentCollapser.java file:
if (currentValue == null) {
   continue;
}

Place it in the doCollapsing method after the following statement:
String currentValue = values.lookup[values.order[currentId]];

This makes sure that documents that have no value in the collapse
field are not collapsed.

Field collapsing has a big impact on your search times and also on
memory usage to a lesser extend. It can increase search times up to 10
times, but this depends per situation. As indexes get bigger this
becomes a bigger problem. Also using field collapsing in a distributed
environment can cause problems. This is due that collapse information
is not shared between shards, resulting in incorrect collapse results.
They only work around for this problem I know is, that you 'll have to
make sure that the groups are distributes evenly between shards and
that a group's documents are not spread across shards.

Other then that there are no further major issues with this patch.
Many people are using this patch in their Solr setups, but it is a
patch so you 'll have to keep that in mind. There are efforts to put
grouping functionality into Solr (without patching) in SOLR-236's
child issues, so keep an eye on these issues.

Cheers,

Martijn

On 3 July 2010 19:20, osocurious2 <ken.fos...@realestate.com> wrote:
>
> <boost>
>
> I wanted to extend my question some. My original question about collapsing
> null fields is still open, but in trying to research elsewhere I see a lot
> of angst about the Field Collapse functionality in general. Can anyone
> summarize what the current state of affairs is with it? I'm on Solr 1.4,
> just the latest release build, not any current builds. Field Collapse seems
> to be in my build because I could do single field collapse just fine (hence
> my null field question). However there seems to be talk of problems with
> Field Collapse that aren't fixed yet. What kinds of issues are people
> having? Should I avoid Field Collapse in a production app for now? (tricky
> because I'm merging my schema with a third party tool schema and they are
> using Field Collapse).
>
> Any insight would be helpful, thanks
> Ken
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Field-Collapse-question-tp939118p940923.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to