On Nov 29, 2010, at 5:17 PM, Shawn Heisey wrote: > I was just in a meeting where we discussed customer feedback on our website. > One thing that the users would like to see is "galleries" where photos that > are part of a set are grouped together under a single result. This is > basically field collapsing. > > The problem I've got is that for most of our content, there's nothing to tie > different photos together in a coherent way other than similar language in > fields like the caption. Is it feasible to use termvector information to > automatically group documents with similar (but not identical) data in one or > more fields?
I'd probably look at the ClusteringComponent as a way to get at what you are after. Either that or you need to somehow normalize the language going in. -Grant