benwtrent commented on PR #14226:
URL: https://github.com/apache/lucene/pull/14226#issuecomment-2657348683

   OK, I ran it on 8M data set with 128 segments. 
   
   Indeed it visits way fewer vectors (seemingly), and is consistent across 
multiple threads. 
   
   ```
   recall  latency(ms)     nDoc  topK  fanout  visited  num segments  
selectivity
    0.719       36.100  8000000   100     100    28015           128        
1.000
   ```
   
   8 threaded qps
   ```
   recall  latency(ms)     nDoc  topK  fanout  visited  num segments  
selectivity
    0.719        5.700  8000000   100     100    28015           128        
1.000
   ```
   
   The current baseline (which isn't multi-threaded consistent, so I am just 
showing the single thread perf):
   
   ```
   recall  latency(ms)     nDoc  topK  fanout  visited  num segments  
selectivity
    0.949       59.900  8000000   100     100    87299           128        
1.000
   ```
   
   Given the visited & latency, I wonder if the "visited" actually represents 
what this query is visiting?
   
   But, maybe the answer is allowing more exploration per segment?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to