[ https://issues.apache.org/jira/browse/HBASE-29398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17977647#comment-17977647 ]
Sanjeet Malhotra commented on HBASE-29398: ------------------------------------------ [~vjasani] [~apurtell] any inputs on above proposal? Thanks > Server side scan metrics for bytes read from FS vs Block cache vs memstore > -------------------------------------------------------------------------- > > Key: HBASE-29398 > URL: https://issues.apache.org/jira/browse/HBASE-29398 > Project: HBase > Issue Type: Improvement > Reporter: Sanjeet Malhotra > Assignee: Sanjeet Malhotra > Priority: Major > > Currently, HBase doesn't have a metric on the server side which counts how > many bytes were read from FS vs block cache vs memstore. Reading cells from > in-memory like block cache or memstore vs from FS can make latencies vary > drastically. > Separate metrics for bytes scanned from block cache vs memstore are > beneficial for use cases which immediately read (like within 5 sec) after > writing the data. There the expectation would be that bytes scanned from FS > or block cache should be zero unless a flush happened (which can be checked > from logs). > Currently, HBase has a server side scan metric `countOfBlockBytesScanned` > which aims to capture the block bytes scanned by read request. But there are > few gaps in the metric: > * It doesn't account for block bytes scanned as part of > KeyValueHeap#pollRealKV(). > * It doesn't account for the bytes index block bytes scanned, bloom filter > bytes scanned. > * It doesn't differentiate between bytes scanned from block cache vs FS. > The proposal is to add 3 new server side scan metrics, one each for: bytes > scanned from FS, bytes scanned from block cache and bytes scanned from > memstore. > > Currently, the aim is to just add these 3 new set of metrics and expose them > via ServerSide scan metrics. Replacing `countOfBlockBytesScanned` by bytes > scanned from FS and bytes scanned from block cache and integrating the new > metrics with HBase Quotas code can be taken up separately. > > I intend to cherry-pick this change to HBase 3 and HBase 2 (till HBase 2.5). -- This message was sent by Atlassian Jira (v8.20.10#820010)