mqliang edited a comment on pull request #6710:
URL: https://github.com/apache/incubator-pinot/pull/6710#issuecomment-809076119


   @Jackie-Jiang @siddharthteotia @mcvsubbu Comment addressed and PR is ready 
for review now. I split the change into 5 commits:
   * 1st commit:
      * Rename TrailerKeys as MetadataKeys
      * Associate ID/Name with MetadataKeys
      * V2->V3 convert instead of construct V3 from V2 bytes
      * ASCII layout of V3 datatable
      * Address a TODO in DataTableBuilder: store bytes data into variable size 
data section, instead of String
     
   * 2nd commit: Address a TODO in DataTableBuilder: fix float size issue at 
DataTableBuilder
   * 3rd commit:  Address a TODO in DataTableBuilder: use one Map to map a 
String to Integer for all columns in V3.
   * 4th commit:  fix a bug at BrokerReduceService, which bring down 
integration test.
   * 5th commit:  Log `responseSerializationCpuTimeNs` at QueryScheduler and 
emit a broker gauge; put "executionThreadCpuTimeNs" and 
"responseSerializationCpuTimeNs" into metadata so that they can be sent to 
broker
   
   
   There is still one more TODO in DataTableBuilder: Given a data schema, write 
all values one by one instead of using rowId and colId to position (save time). 
It will not impact the serialized bytes layout of data table, it's just some 
implementation optimization. Which means it does not require a version bumping 
up, so can be done in a separate PR, I create a issue: 
https://github.com/apache/incubator-pinot/issues/6720 to track this. And a 
preliminary benchmark shows that the optimization is quite speculative -- there 
is no improvement by writing all values one by one without using rowId and 
colId to position, for more details, see the benchmark result at: 
https://github.com/apache/incubator-pinot/issues/6720
   
   There is one more thing need to be done: change the interface of 
`DataTable.getMetadata()` returns a `Map<MetadataKeys, String>`, instead of 
`Map<String, String>`. This PR is already quite large,  I wanner address it in 
a separate PR.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to