Re: Grouping by simhash signature

2015-12-09 Thread Nickolay41189
Maybe there is some way to override equals function of grouping (change "==" to strdist)? -- View this message in context: http://lucene.472066.n3.nabble.com/Grouping-by-simhash-signature-tp4243236p4244541.html Sent from the Solr - User mailing list archive at Nabble.com.

Grouping by simhash signature

2015-12-02 Thread Nickolay41189
I try to implement NearDup detection by SimHash algorithm in Solr. Let's say: 1) each document has a field /simhash_signature/ that stores a sequence of bits. 2) that in order to be considered NearDup, documents must have, at most, 2 bits that

Re: How to get SortedDocValues in lucene-5.2.1

2015-07-11 Thread Nickolay41189
But what about using cache? In my old code I used it. I need the same functionality. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-get-SortedDocValues-in-lucene-5-2-1-tp4216858p4216869.html Sent from the Solr - User mailing list archive at Nabble.com.

How to get SortedDocValues in lucene-5.2.1

2015-07-11 Thread Nickolay41189
In lucene-4.10.1 for to get *SortedDocValues *I used this code: SolrIndexSearcher searcher = request.getSearcher(); AtomicReader reader = searcher.getAtomicReader(); SortedDocValues sourceIndex = FieldCache.DEFAULT.getTermsIndex(reader, fieldName); It worked fine, but now I don't understand what

Re: Term count in multivalue fields

2014-11-08 Thread Nickolay41189
"while indexing add a field containing the number" isn't suitable for my case. I can't add new field and do indexing. -- View this message in context: http://lucene.472066.n3.nabble.com/Term-count-in-multivalue-fields-tp4168138p4168400.html Sent from the Solr - User mailing list archive at Nabb

Re: Term count in multivalue fields

2014-11-07 Thread Nickolay41189
Andrey, thank you for reply. Can you explain what do you mean "faceting query with prefix"? I'm newer in the wolrd of Solr, can you give me example of this query? -- View this message in context: http://lucene.472066.n3.nabble.com/Term-count-in-multivalue-fields-tp4168138p4168167.html Sent from

Sort documents by exist(multivalued field)

2014-11-07 Thread Nickolay41189
I want to sort by multivalued field like boolean values. Something like that: *sort exist(multivalued field name) desc* Is it possible? P.S. I know that sorting doesn't work for multivalued fields, but it work for single boolean field... -- View this message in context: http://lucene.472066.n

Sort documents by first value in multivalued field

2014-11-07 Thread Nickolay41189
How can I sort documents by first value in multivalued field? (without adding copyField and without some changes in schema.xml)? -- View this message in context: http://lucene.472066.n3.nabble.com/Sort-documents-by-first-value-in-multivalued-field-tp4168140.html Sent from the Solr - User mailin

Term count in multivalue fields

2014-11-07 Thread Nickolay41189
I have multivalue field in my schema.xml: I have indexed the following documents: ... bmp zip bmp ... ... zip zip bmp ... How can I retrieve the count of the term "zip" (in this example it must be 3) from all multivalued field in this index without add