For me, I'm using the signature field grouping method, as shown from this
website: https://cwiki.apache.org/confluence/display/solr/De-Duplication
You can set the signatureField to be "title", then during the query,
instead of using &group=true&group.field=title, you can use
&group=true&group.fiel
I second Toke’s recommendation to ensure you have a pure string-version of your
title.
For pure de-duplication you could also consider the lighter-weight
CollapseComponent
Instead of &group=true&group.field=title, use &fq={!collapse field=title_string}
See
https://cwiki.apache.org/confluence/d
On Tue, 2015-11-03 at 14:53 +0530, vishal raut wrote:
> I have indexed various videos in solr which I have in my database. I want
> to search for those video titles, but there can be duplicate video titles
> as well (If the video is same but source is different, this will have
> separate entry in s
Hello,
In context to the question I asked on Solr confluence (I have copied the
conversation at the end of this mail).
I have indexed various videos in solr which I have in my database. I want
to search for those video titles, but there can be duplicate video titles
as well (If the video is same