[ 
https://issues.apache.org/jira/browse/LUCENE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17200370#comment-17200370
 ] 

Adrien Grand commented on LUCENE-9537:
--------------------------------------

Sorry, this might be a very naive question, but would associating a smoothing 
score to documents when they don't match a term essentially have the same 
effect as subtracting this smoothing score whenever a document contains a term, 
and the absolute scores would be different but the ordering of hits would 
remain the same? I'm asking because the fact that terms that don't match don't 
contribute a score is hardcoded into Lucene so if there were other ways we 
could achieve the same effect, it might be easier to integrate.

> Add Indri Search Engine Functionality to Lucene
> -----------------------------------------------
>
>                 Key: LUCENE-9537
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9537
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/search
>            Reporter: Cameron VandenBerg
>            Priority: Major
>              Labels: patch
>         Attachments: LUCENE-INDRI.patch
>
>
> Indri ([http://lemurproject.org/indri.php]) is an academic search engine 
> developed by The University of Massachusetts and Carnegie Mellon University.  
> The major difference between Lucene and Indri is that Indri will give a 
> document a "smoothing score" to a document that does not contain the search 
> term, which has improved the search ranking accuracy in our experiments.  I 
> have created an Indri patch, which adds the search code needed to implement 
> the Indri AND logic as well as Indri's implementation of Dirichlet Smoothing.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to