[ 
https://issues.apache.org/jira/browse/LUCENE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17217058#comment-17217058
 ] 

Cameron VandenBerg commented on LUCENE-9537:
--------------------------------------------

Hello!

 

I have uploaded a new/different version of the patch for Indri functionality to 
be added to Lucene.  In this newer patch, I have added the smoothingScore 
method to the Scorable interface and then implemented that method in each of 
the scorers.  For everything but the TermScorer, smoothingScore returns 0.  For 
TermScorer, smoothingScore calls the docScorer just like score does, just with 
the term frequency of 0.  Additionally, I added comments to the Indri classes 
to make the logic clear.

 

This implementation allows the Indri logic to be more tightly coupled with 
Lucene so that there is less forking of the Lucene classes.   While I did need 
to change a number of Lucene classes, the addition of the smoothingScore method 
will not change anything else in the Lucene code.

 

Could you let me know which version of the patch would work better with Lucene? 
 Also, I am very open to suggestions if there is something you think would work 
better.  I would really like to be able to move forward with being able to add 
the work to Lucene.

 

Thank You!

> Add Indri Search Engine Functionality to Lucene
> -----------------------------------------------
>
>                 Key: LUCENE-9537
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9537
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/search
>            Reporter: Cameron VandenBerg
>            Priority: Major
>              Labels: patch
>         Attachments: LUCENE-9537.patch, LUCENE-INDRI.patch
>
>
> Indri ([http://lemurproject.org/indri.php]) is an academic search engine 
> developed by The University of Massachusetts and Carnegie Mellon University.  
> The major difference between Lucene and Indri is that Indri will give a 
> document a "smoothing score" to a document that does not contain the search 
> term, which has improved the search ranking accuracy in our experiments.  I 
> have created an Indri patch, which adds the search code needed to implement 
> the Indri AND logic as well as Indri's implementation of Dirichlet Smoothing.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to