mikemccand opened a new issue, #13503:
URL: https://github.com/apache/lucene/issues/13503

   ### Description
   
   Spinoff from the exciting discussion on 
https://github.com/apache/lucene/pull/13472:
   
   Lucene has made great gains recently on intra-query concurrency: using 
multiple threads (with a "slice" work unit = one or more segments) to reduce 
latency of queries.  Besides faster wall clock time, since CPU is running 
concurrently, Lucene also can gain efficiency because segments can terminate 
earlier / start using skipping as the more competitive results from other 
segments arrive sooner/concurrently, causing less total CPU to be spent to get 
the top hits for the query.
   
   But I think Lucene's doc values and taxonomy facets do not use any 
concurrency?  Even if you pass a `TaskExecutor` to `IndexSearcher`, facet 
counting will still run single threaded.  Can we fix this to also make facet 
counting faster (net elapsed wall clock time)?  It's tricky because some facet 
counting aggregate into data structures (like `int[]` or an HPPC int->int map) 
that are not easily made thread safe?
   
   Note: we do have `ConcurrentSortedSetDocValuesFacetCounts` which does use 
concurrency, but the other facet counting (numeric ranges, taxonomy facets) do 
not.  Also, `ConcurrentSortedSetDocValuesFacetCounts` takes its own 
`ExecutorService` not a `TaskExecutor`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to