mikemccand commented on issue #12696: URL: https://github.com/apache/lucene/issues/12696#issuecomment-1775638986
> Are there any additional corpora that we should also test this with? Maybe the NYC taxis? This is a more sparse, and tiny docs (vs dense and medium/large docs in `enwiki`). The tooling for indexing the NYC taxis corpus is already in `luceneutil` (it runs nightly: https://home.apache.org/~mikemccand/lucenebench/sparseResults.html). This is a nice counter-point to `enwiki`. > Would this be a potential change for Lucene 9.9 or perhaps 10.0? That's a good question. It is a very low level index format change, and no API change. It would be fully back-compat whether we release in 9.9 vs 10.0. I don't see why we should withhold the change until 10.0, so maybe 9.9? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org