[ 
https://issues.apache.org/jira/browse/LUCENE-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17314008#comment-17314008
 ] 

Julie Tibshirani commented on LUCENE-9657:
------------------------------------------

Closing this out for now, feel free to follow-up in Elasticsearch!

> Unified Highlighter throws too_complex_to_determinize_exception with >288 
> filter terms
> --------------------------------------------------------------------------------------
>
>                 Key: LUCENE-9657
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9657
>             Project: Lucene - Core
>          Issue Type: Bug
>    Affects Versions: 8.6.2
>            Reporter: Isaac Doub
>            Priority: Major
>
> There seems to be a problem with the Unified Highlighter in Lucene 8.6.2 that 
> is affecting ElasticSearch 7.9.1. If a search is performed with >288 filter 
> terms using the unified highlighter it throws a 
> too_complex_to_determinize_exception error, but if you switch to the plain 
> highlighter it works fine. Alternatively, if you filter on a "copy_to" field 
> instead of the indexed field, it also works.
>  
> This throws the error
> {code:java}
>  {
>     "highlight": {
>         "type": "unified",
>         "fields": {
>             "title": {
>                 "require_field_match": false
>             }
>         }
>     },
>     "query": {
>         "bool": {
>             "must": [{
>                 "query_string": {
>                     "query": "*"
>                 }
>             }],
>             "filter": [{
>                 "bool": {
>                     "must": [{
>                         "terms": {
>                             "id": [ ">288 terms here" ]
>                         }
>                     }]
>                 }
>             }]
>         }
>     }
> }{code}
>  
>  
> But this works fine
> {code:java}
>  {
>     "highlight": {
>         "type": "plain",
>         "fields": {
>             "title": {
>                 "require_field_match": false
>             }
>         }
>     },
>     "query": {
>         "bool": {
>             "must": [{
>                 "query_string": {
>                     "query": "*"
>                 }
>             }],
>             "filter": [{
>                 "bool": {
>                     "must": [{
>                         "terms": {
>                             "id": [ ">288 terms here" ]
>                         }
>                     }]
>                 }
>             }]
>         }
>     }
> }{code}
>  
>  
> Or if I adjust the search to use the copy_to field it works as well (note 
> "id" is now "_id")
> {code:java}
>  {
>     "highlight": {
>         "type": "unified",
>         "fields": {
>             "title": {
>                 "require_field_match": false
>             }
>         }
>     },
>     "query": {
>         "bool": {
>             "must": [{
>                 "query_string": {
>                     "query": "*"
>                 }
>             }],
>             "filter": [{
>                 "bool": {
>                     "must": [{
>                         "terms": {
>                             "_id": [ ">288 terms here" ]
>                         }
>                     }]
>                 }
>             }]
>         }
>     }
> }{code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to