Daniel,

You can start from here
http://lucene.apache.org/core/4_0_0-BETA/core/org/apache/lucene/search/similarities/Similarity.html#coord%28int,%20int%29but
it requires deep understanding of Lucene internals



On Tue, Jan 29, 2013 at 2:12 PM, Daniel Rosher <rosh...@gmail.com> wrote:

> Hi,
>
> I'm wondering if there exists or if someone has implemented something like
> the following as a function query:
>
> overlap(query,field) = number of matching terms in field/number of terms in
> field
>
> e.g. with three docs having these tokens(e.g.A B C) in a field
> D
> 1:A B B
> 2:A B
> 3:A
>
> The overlap would be for these queries (-- highlights possibly highest
> scoring doc):
>
> Q:A
> 1:1/3
> 2:1/2
> 3:1/1 --
>
> Q:A B
> 1:2/3
> 2:2/2 --
> 3:1/1
>
> Q:A B C
> 1:2/3
> 2:2/2 --
> 3:1/1
>
> The objective to to pick the most likely doc using the overlap to boost the
> score.
>
> Cheers,
> Dan
>



-- 
Sincerely yours
Mikhail Khludnev
Principal Engineer,
Grid Dynamics

<http://www.griddynamics.com>
 <mkhlud...@griddynamics.com>

Reply via email to