The current patch definitely supports facet before and after the collapsing.
Stephen Weiss wrote:
I just noticed this and it reminded me of an issue I've had with
collapsed faceting with an older version of the patch in Solr 1.3.
Would it be possible, if we can get the terms for all the collapsed
documents on a field, to then facet each collapsed document on the
unique terms it has collectively? What I mean is for example:
Doc 1, 2, 3 collapse together on some other field
Doc 1 is the "main document" and has the "colors" blue and red
Doc 2 has red
Doc 3 has green
For the purposes of faceting, it would be ideal in our case for
faceting on color to count one each for blue, red, and green on this
document (the user drills down on this value to yet another collapsed
set). Right now, when you facet after collapse you just get blue and
red (green is dropped because it collapses out). To the user it makes
the counts seem inaccurate, like they're missing something. Instead
we facet before collapsing and get an "inflated" value (which ticks 2
for red - but when you drill down, you still only get 1 because Doc 1
and Doc 2 collapse together again). Either way it's not ideal.
At the time (many months ago) there was no way to account for this but
it sounds like this patch could make it possible, maybe.
Thanks!
--
Steve
On Sep 5, 2009, at 5:57 AM, Uri Boness wrote:
There's work on the patch that is being done now which will enable
you to ask for specific field values of the collapsed documents using
a dedicated request parameter. This work is not committed yet to the
latest patch, but will be very soon. There is of course a drawback to
that as well, the collapsed documents set can be very large (depends
on your data of course) in which case the returned result which
includes the fields values can be rather large, which will impact
performance, this is why this feature will be enabled only if you
specify this extra parameter - by default no field values will be
returned.
AFAIK, the latest patch should work fine with the latest build.
Martijn (which is the main maintainer of this patch) tries to keep it
up to date with the latest builds. But I guess the safest way is to
work with the nightly build of the same date as the latest patch
(though I would give it a try first with the latest build).
BTW, it's not an official suggestion from the Solr development team,
but if you ask me, if you have to choose now whether to use 1.3 or
1.4-dev, I would go for the later. 1.4 is supposed to be released in
the upcoming week or two and it bring loads of bug fixes,
enhancements and extra functionality. But again, this is my personal
suggestion.
cheers,
Uri