[ 
https://issues.apache.org/jira/browse/LUCENE-9365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17101829#comment-17101829
 ] 

Mark Harwood commented on LUCENE-9365:
--------------------------------------

Wildcard and Regexp are both awkward solutions to this problem - they both come 
with a syntax whose characters must be escaped in any prefix string.

Perhaps another way of addressing the problem is to enhance PrefixQuery to 
allow the option of limiting the number of chars allowed after the prefix 
rather than the current unlimited number.

> Fuzzy query has a false negative when prefix length == search term length 
> --------------------------------------------------------------------------
>
>                 Key: LUCENE-9365
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9365
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/query/scoring
>            Reporter: Mark Harwood
>            Priority: Major
>
> When using FuzzyQuery the search string `bba` does not match doc value `bbab` 
> with an edit distance of 1 and prefix length of 3.
> In FuzzyQuery an automaton is created for the "suffix" part of the search 
> string which in this case is an empty string.
> In this scenario maybe the FuzzyQuery should rewrite to a WildcardQuery of 
> the following form :
> {code:java}
>     searchString + "?" 
> {code}
> .. where there's an appropriate number of ? characters according to the edit 
> distance.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to