Jackie-Jiang commented on PR #8818: URL: https://github.com/apache/pinot/pull/8818#issuecomment-1154271827
> Thanks @atris for taking a look. I also have a question around how Lucene and native FST handle the query, specifically for these results: ``` SELECT INT_COL FROM MyTable WHERE regexp_like(DOMAIN_NAMES, '(?:^.*domain9.*$)|...|(?:^.*domain1.*$)') BenchmarkFuseRegexp.decreasing9Fusing LUCENE avgt 5 24.198 ± 13.439 ms/op BenchmarkFuseRegexp.decreasing9Fusing NATIVE avgt 5 0.263 ± 0.019 ms/op BenchmarkFuseRegexp.decreasing9Fusing null avgt 5 0.476 ± 0.040 ms/op SELECT INT_COL FROM MyTable WHERE regexp_like(DOMAIN_NAMES, '(?:^.*domain0.*$)|...|(?:^.*domain9.*$)') BenchmarkFuseRegexp.increasing10Fusing LUCENE avgt 5 ERROR BenchmarkFuseRegexp.increasing10Fusing NATIVE avgt 5 0.326 ± 0.138 ms/op BenchmarkFuseRegexp.increasing10Fusing null avgt 5 0.226 ± 0.062 ms/op ``` We can see that Lucene FST is very slow in this case, but native is still performing okay. Trying to understand what is causing this difference. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org