This is an automated email from the ASF dual-hosted git repository. jakevin pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/doris.git
The following commit(s) were added to refs/heads/master by this push: new c885913efe Fixed incorrect description of column length limit in partition bucketing documentation. (#12154) c885913efe is described below commit c885913efecafe4fc07b3cb4c8e126415b514004 Author: FreeOnePlus <54164178+freeonep...@users.noreply.github.com> AuthorDate: Tue Aug 30 00:00:51 2022 +0800 Fixed incorrect description of column length limit in partition bucketing documentation. (#12154) Signed-off-by: manyi <f...@freeoneplus.com> Signed-off-by: manyi <f...@freeoneplus.com> Co-authored-by: manyi <f...@freeoneplus.com> --- docs/en/docs/data-table/data-partition.md | 19 +++++++++---------- docs/zh-CN/docs/data-table/data-partition.md | 1 - 2 files changed, 9 insertions(+), 11 deletions(-) diff --git a/docs/en/docs/data-table/data-partition.md b/docs/en/docs/data-table/data-partition.md index 441e43cee8..31f45f6a05 100644 --- a/docs/en/docs/data-table/data-partition.md +++ b/docs/en/docs/data-table/data-partition.md @@ -134,7 +134,6 @@ When defining columns, you can refer to the following suggestions: 2. Try to choose the type of integer. Because integer type calculations and lookups are much more efficient than strings. 3. For the selection principle of integer types of different lengths, follow **enough to**. 4. For lengths of type VARCHAR and STRING, follow **enough to**. -5. The total byte length of all columns (including Key and Value) cannot exceed 100KB. ### Partitioning and Bucket @@ -224,18 +223,18 @@ It is also possible to use only one layer of partitioning. When using a layer pa PARTITION `p201702_2000` VALUES LESS THAN ("2017-03-01", "2000"), PARTITION `p201703_all` VALUES LESS THAN ("2017-04-01") ) - ``` - + ``` + In the above example, we specify `date` (DATE type) and `id` (INT type) as partition columns. The resulting partitions in the above example are as follows: - + ``` *p201701_1000: [(MIN_VALUE, MIN_VALUE), ("2017-02-01", "1000") ) *p201702_2000: [("2017-02-01", "1000"), ("2017-03-01", "2000") ) *p201703_all: [("2017-03-01", "2000"), ("2017-04-01", MIN_VALUE)) ``` - + Note that the last partition user defaults only the partition value of the `date` column, so the partition value of the `id` column will be filled with `MIN_VALUE` by default. When the user inserts data, the partition column values are compared in order, and the corresponding partition is finally obtained. Examples are as follows: - + ``` * Data --> Partition * 2017-01-01, 200 --> p201701_1000 @@ -290,17 +289,17 @@ It is also possible to use only one layer of partitioning. When using a layer pa PARTITION `p3_city` VALUES IN (("3", "Beijing"), ("3", "Shanghai")) ) ``` - + In the above example, we specify `id`(INT type) and `city`(VARCHAR type) as partition columns. The above example ends up with the following partitions. - + ``` * p1_city: [("1", "Beijing"), ("1", "Shanghai")] * p2_city: [("2", "Beijing"), ("2", "Shanghai")] * p3_city: [("3", "Beijing"), ("3", "Shanghai")] ``` - + When the user inserts data, the partition column values will be compared sequentially in order to finally get the corresponding partition. An example is as follows. - + ``` * Data ---> Partition * 1, Beijing ---> p1_city diff --git a/docs/zh-CN/docs/data-table/data-partition.md b/docs/zh-CN/docs/data-table/data-partition.md index 1e3be4b31b..b186f51c2b 100644 --- a/docs/zh-CN/docs/data-table/data-partition.md +++ b/docs/zh-CN/docs/data-table/data-partition.md @@ -138,7 +138,6 @@ AGGREGATE KEY 数据模型中,所有没有指定聚合方式(SUM、REPLACE 2. 尽量选择整型类型。因为整型类型的计算和查找效率远高于字符串。 3. 对于不同长度的整型类型的选择原则,遵循 **够用即可**。 4. 对于 VARCHAR 和 STRING 类型的长度,遵循 **够用即可**。 -5. 所有列的总字节长度(包括 Key 和 Value)不能超过 100KB。 ### 分区和分桶 --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org