luyuncheng commented on PR #11987:
URL: https://github.com/apache/lucene/pull/11987#issuecomment-1333730146

   > We could run the StoredFieldsBenchmark before and after the change with 
-Dorg.apache.lucene.codecs.lucene90.compressing.Lucene90CompressingStoredFieldsWriter.enableBulkMerge=false
 to force the slow merge path.
   > In all cases when running the benchmark, we may want to explicitly supply 
smaller heap (-Xmx),
   
   @rmuir I just modified 
https://github.com/mikemccand/luceneutil/blob/master/src/python/runStoredFieldsBenchmark.py#L43
 with 
   `command = f'{localconstants.JAVA_EXE} -Xmx256m 
-Dorg.apache.lucene.codecs.lucene90.compressing.Lucene90CompressingStoredFieldsWriter.enableBulkMerge=false
 -cp {lucene_core_jar}:build perf.StoredFieldsBenchmark {geoname    s_csv_in} 
{localconstants.INDEX_DIR_BASE}/geonames-stored-fields {mode} {doc_limit}`
   
   i do 4 different runStoredFieldsBenchmark as following tables shows which 
shows little performance regressions:
   
   runStoredFieldsBenchmark.py __enableBulkMerge=false__
   |       | Baseline | Candidate     |
   | :---        |    :----:   |          ---: |
   | indexing_time_msec| | |
   | BEST_SPEED      | 365665.00   | 372287.00  |
   | BEST_COMPRESSION   | 849157.00        | 848813.00   |
   | retrieved_time_msec | | |
   | BEST_SPEED |  246.62 |  269.32 | 
   | BEST_COMPRESSION | 2606.98 | 2634.53  | 
   
   runStoredFieldsBenchmark.py __enableBulkMerge=false -Xmx1g__
   |                     | Baseline  | Candidate  |
   | :---                |  :----:   |    ---:    |
   | indexing_time_msec  |           |            |
   | BEST_SPEED          | 372457.00 | 366094.00  |
   | BEST_COMPRESSION    | 850273.00 | 852397.00  |
   | retrieved_time_msec |           |            |
   | BEST_SPEED          |  247.70   |  279.11    | 
   | BEST_COMPRESSION    | 2585.59   | 2633.83    | 
   
   runStoredFieldsBenchmark.py __enableBulkMerge=false -Xmx512m__
   |                     | Baseline  | Candidate  |
   | :---                |  :----:   |    ---:    |
   | indexing_time_msec  |           |            |
   | BEST_SPEED          | 368389.00 | 370878.00  |
   | BEST_COMPRESSION    | 851277.00 | 850121.00  |
   | retrieved_time_msec |           |            |
   | BEST_SPEED          |  256.80   |  280.52    | 
   | BEST_COMPRESSION    | 2576.36   | 2645.32    | 
   
   runStoredFieldsBenchmark.py __enableBulkMerge=false -Xmx256m__
   |                     | Baseline  | Candidate  |
   | :---                |  :----:   |    ---:    |
   | indexing_time_msec  |           |            |
   | BEST_SPEED          | 366735.00 | 368407.00  |
   | BEST_COMPRESSION    | 849980.00 | 852214.00  |
   | retrieved_time_msec |           |            |
   | BEST_SPEED          |  256.10   |  278.06    | 
   | BEST_COMPRESSION    | 2584.96   | 2632.69    | 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to