[
https://issues.apache.org/jira/browse/LUCENE-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17314008#comment-17314008
]
Julie Tibshirani commented on LUCENE-9657:
------------------------------------------
Closing this out for now, feel free to follow-up in Elasticsearch!
> Unified Highlighter throws too_complex_to_determinize_exception with >288
> filter terms
> --------------------------------------------------------------------------------------
>
> Key: LUCENE-9657
> URL: https://issues.apache.org/jira/browse/LUCENE-9657
> Project: Lucene - Core
> Issue Type: Bug
> Affects Versions: 8.6.2
> Reporter: Isaac Doub
> Priority: Major
>
> There seems to be a problem with the Unified Highlighter in Lucene 8.6.2 that
> is affecting ElasticSearch 7.9.1. If a search is performed with >288 filter
> terms using the unified highlighter it throws a
> too_complex_to_determinize_exception error, but if you switch to the plain
> highlighter it works fine. Alternatively, if you filter on a "copy_to" field
> instead of the indexed field, it also works.
>
> This throws the error
> {code:java}
> {
> "highlight": {
> "type": "unified",
> "fields": {
> "title": {
> "require_field_match": false
> }
> }
> },
> "query": {
> "bool": {
> "must": [{
> "query_string": {
> "query": "*"
> }
> }],
> "filter": [{
> "bool": {
> "must": [{
> "terms": {
> "id": [ ">288 terms here" ]
> }
> }]
> }
> }]
> }
> }
> }{code}
>
>
> But this works fine
> {code:java}
> {
> "highlight": {
> "type": "plain",
> "fields": {
> "title": {
> "require_field_match": false
> }
> }
> },
> "query": {
> "bool": {
> "must": [{
> "query_string": {
> "query": "*"
> }
> }],
> "filter": [{
> "bool": {
> "must": [{
> "terms": {
> "id": [ ">288 terms here" ]
> }
> }]
> }
> }]
> }
> }
> }{code}
>
>
> Or if I adjust the search to use the copy_to field it works as well (note
> "id" is now "_id")
> {code:java}
> {
> "highlight": {
> "type": "unified",
> "fields": {
> "title": {
> "require_field_match": false
> }
> }
> },
> "query": {
> "bool": {
> "must": [{
> "query_string": {
> "query": "*"
> }
> }],
> "filter": [{
> "bool": {
> "must": [{
> "terms": {
> "_id": [ ">288 terms here" ]
> }
> }]
> }
> }]
> }
> }
> }{code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]