To comment further, I think it would be good if we had query benchmarks that tried to map directly to the queries benchmarked for lucene, against the same data. This would give us a notion of which half of the equation any slow down comes from (or speed up!)
On Fri, Jun 28, 2024 at 2:42 PM Gus Heck <gus.h...@gmail.com> wrote: > Yes. In fact I have an example in JesterJ of indexing the luceneutil > data... (but it's still in early stages, I think it's still against > _default schema, perhaps... (need to look again, haven't had time to work > on it recently) > > https://github.com/nsoft/jesterj/tree/master/code/examples/wikidocs > > On Fri, Jun 28, 2024 at 11:33 AM David Smiley <dsmi...@apache.org> wrote: > >> I was thinking of using Solr's "benchmark" module/thing to benchmark >> parallel segment search (coming to Solr 9.7 but needs more love). I >> don't notice any substantial data to query for in this module, >> however. Has anyone considered adding wikipedia, like how Lucene's >> "luceneutil" does? Or something else? Is this a bad idea for this >> benchmark module or should I be looking elsewhere like Searchscale's >> solr-bench[1]? >> >> https://github.com/searchscale/solr-bench >> >> ~ David Smiley >> Apache Lucene/Solr Search Developer >> http://www.linkedin.com/in/davidwsmiley >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: dev-unsubscr...@solr.apache.org >> For additional commands, e-mail: dev-h...@solr.apache.org >> >> > > -- > http://www.needhamsoftware.com (work) > https://a.co/d/b2sZLD9 (my fantasy fiction book) > -- http://www.needhamsoftware.com (work) https://a.co/d/b2sZLD9 (my fantasy fiction book)