Daniel, You can start from here http://lucene.apache.org/core/4_0_0-BETA/core/org/apache/lucene/search/similarities/Similarity.html#coord%28int,%20int%29but it requires deep understanding of Lucene internals
On Tue, Jan 29, 2013 at 2:12 PM, Daniel Rosher <rosh...@gmail.com> wrote: > Hi, > > I'm wondering if there exists or if someone has implemented something like > the following as a function query: > > overlap(query,field) = number of matching terms in field/number of terms in > field > > e.g. with three docs having these tokens(e.g.A B C) in a field > D > 1:A B B > 2:A B > 3:A > > The overlap would be for these queries (-- highlights possibly highest > scoring doc): > > Q:A > 1:1/3 > 2:1/2 > 3:1/1 -- > > Q:A B > 1:2/3 > 2:2/2 -- > 3:1/1 > > Q:A B C > 1:2/3 > 2:2/2 -- > 3:1/1 > > The objective to to pick the most likely doc using the overlap to boost the > score. > > Cheers, > Dan > -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> <mkhlud...@griddynamics.com>