[
https://issues.apache.org/jira/browse/HBASE-29569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wellington Chevreuil updated HBASE-29569:
-----------------------------------------
Release Note:
This introduces a regex based built-in tiering value provider for parsing dates
directly from the rowkey portion of Cells. enabling time-based priority
comparison. The implementation utilizes a configurable regular expression and
date format to isolate a date substring within the RowKey, returning the parsed
epoch time in milliseconds.
Usage:
This feature is controlled via table-level configuration properties. To utilize
this provider, specify the class in the TIERING_VALUE_PROVIDER property and
define the
Configuration properties :
TIERING_KEY_DATE_PATTERN: Regex pattern for date extraction.
TIERING_KEY_DATE_FORMAT: Expected date format.
TIERING_KEY_DATE_GROUP: The target regex extract group.
Example using hbase shell:
-----
hbase(main):003:0> alter 'orders', {NAME => 'cf1',
CONFIGURATION => {'hbase.hstore.datatiering.type' => 'CUSTOM',
'TIERING_KEY_DATE_PATTERN' => '_(\\d{4}-\\d{2}-\\d{2})_',
'TIERING_KEY_DATE_FORMAT' => 'yyyy-MM-dd',
'TIERING_KEY_DATE_GROUP' => '1',
'hbase.hstore.datatiering.hot.age.millis' => '604800000',
'hbase.hstore.engine.class' =>
'org.apache.hadoop.hbase.regionserver.CustomTieredStoreEngine',
'hbase.hstore.compaction.date.tiered.custom.age.limit.millis' => '604800000'
}
}
-----
Note: Please ensure strict validation of regex patterns and date formats to
ensure accurate parsing.
was:
This introduces a regex based built-in tiering value provider for parsing dates
directly from the rowkey portion of Cells. enabling time-based priority
comparison. The implementation utilizes a configurable regular expression and
date format to isolate a date substring within the RowKey, returning the parsed
epoch time in milliseconds.
Usage:
This feature is controlled via table-level configuration properties. To utilize
this provider, specify the class in the TIERING_VALUE_PROVIDER property and
define the
Configuration properties :
TIERING_KEY_DATE_PATTERN: Regex pattern for date extraction.
TIERING_KEY_DATE_FORMAT: Expected date format.
TIERING_KEY_DATE_GROUP: The target regex extract group.
Example using hbase shell:
{noformat}
{noformat}
Note: Please ensure strict validation of regex patterns and date formats to
ensure accurate parsing.
> Implement a built-in TieringValueProvider for parsing the date value from the
> rowkey
> ------------------------------------------------------------------------------------
>
> Key: HBASE-29569
> URL: https://issues.apache.org/jira/browse/HBASE-29569
> Project: HBase
> Issue Type: Sub-task
> Reporter: Wellington Chevreuil
> Assignee: Dev Hingu
> Priority: Minor
> Labels: pull-request-available
> Fix For: 3.0.0, 2.7.0, 2.6.5
>
>
> We could implement a built-in TieringValueProvider that loads a regex through
> configuration and uses this to parse the date value from the rowkey, and use
> this for the time based priority comparison.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)