Re: Find duplicates

2014-12-02 Thread Alexandre Rafalovitch
And if I am correct, enabling docValues will do this kind of grouping as part of the indexing with docValues data structure (per segment). So, all one has to do is to get it back (through faceting). Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newslett

RE: Find duplicates

2014-12-02 Thread Gonzalo Rodriguez
Have you tried using result grouping for your query? There are some very good examples in the wiki: https://wiki.apache.org/solr/FieldCollapsing Gonzalo -Original Message- From: Peter Kirk [mailto:p...@alpha-solutions.dk] Sent: Tuesday, December 02, 2014 9:58 AM To: solr-user@lucene.a

Re: Find duplicates

2014-12-02 Thread Erik Hatcher
Sort of… if you indexed the full value of the field (and you’re looking for truly exact matches) as a string field type you could facet on that field with facet.mincount=2 and the facets returned would be the ones with duplicate values. You’d have to drill down on each of the facets returned to