gaobinlong opened a new pull request, #15167:
URL: https://github.com/apache/lucene/pull/15167
### Description
Resolve a TODO in FirstPassGroupingCollector, introduce a new parameter
`ignoreDocsWithoutGroupField`, if it's true, ignore documents that don't have
the group field instead of putting them into a null group.
This change can also improve the field collapsing performance, I did some
benchmark test against OpenSearch, here are the result:
```
Query DSL:
GET big5/_search?size=10
{
"query": {
"match": {
"message": "50-136-239-27"
}
},
"collapse": {
"field": "count"
}
}
Before:
| 50th percentile service time |
collapsing | 217.497 | ms |
| 90th percentile service time |
collapsing | 221.677 | ms |
| 99th percentile service time |
collapsing | 236.481 | ms |
| 99.9th percentile service time |
collapsing | 266.285 | ms |
| 100th percentile service time |
collapsing | 271.294 | ms |
After:
| 50th percentile service time |
collapsing | 180.974 | ms |
| 90th percentile service time |
collapsing | 185.945 | ms |
| 99th percentile service time |
collapsing | 205.012 | ms |
| 99.9th percentile service time |
collapsing | 222.983 | ms |
| 100th percentile service time |
collapsing | 259.105 | ms |
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]