Getting unique key of a document inside of a Similarity class.

J-Pro Thu, 19 Feb 2015 13:36:14 -0800

Good afternoon.

I need to uniquely identify a document inside of a Similarity classduring scoring. Is it possible to get value of unique key of a documentat this point?

For some time I though I can use internal docID for achieving that.Method score(int doc, float freq) is called after every query executionfor each matched doc. For each indexed doc it equals 0, 1, 2, etc. Butthis is only when documents indexed in a bulk, i.e. in single HTTPrequest. But when docs are indexed in separate requests, these docIdsequal 0 for all documents.


To summarize, here are 2 final questions:

1. Is docIds behavior described above a bug or a feature? Obviously, ifit's a bug and I can use docID to uniquely identify a document, then myquestion is answered after this bug is fixed.2. If docIds behavior described above is normal, then what is analternative way of uniquely identify a document inside of a Similarityclass during scoring? Can I get unique key of a scoring document inSimilarity?

FYI: I have asked 1st question in #solr IRC channel. The person namedhoss answered the following: "you're seeing the *internal* docIds ...you can't assign any special meaning to them ... i believe that at thelevel of the Similarity class, these may even be per segment, whichmeans that in the context of a SegmentReader they can be used to getthings like docValues, but they odn't have any meaning compared to youruniqueKey (for example)". This kinda makes me think that answer for the1st question is "it's a feature". But I am still not sure and don't knowthe answer to the 2nd question. Please help.


Thank you very much in advance.

Getting unique key of a document inside of a Similarity class.

Reply via email to