This is an automated email from the ASF dual-hosted git repository. luzhijing pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push: new 5a08e34ed9 [docs](3.0) Update typo and sidebar of 3.0 version docs (#908) 5a08e34ed9 is described below commit 5a08e34ed91942b0908063d05fa879e83f85144c Author: KassieZ <139741991+kass...@users.noreply.github.com> AuthorDate: Wed Jul 24 18:48:16 2024 +0800 [docs](3.0) Update typo and sidebar of 3.0 version docs (#908) --- blog/migrate-lakehouse-from-bigquery-to-doris.md | 2 +- docs/data-operate/export/export-overview.md | 41 +++++++++++----------- docs/practical-guide/log-storage-analysis.md | 2 +- .../current/data-operate/export/export-overview.md | 5 +-- .../practical-guide/log-storage-analysis.md | 2 +- .../practical-guide/log-storage-analysis.md | 2 +- .../data-operate/export/export-overview.md | 5 +-- .../practical-guide/log-storage-analysis.md | 2 +- .../data-operate/export/export-overview.md | 5 +-- .../practical-guide/log-storage-analysis.md | 2 +- .../practical-guide/log-storage-analysis.md | 2 +- .../data-operate/export/export-overview.md | 40 +++++++++++---------- .../practical-guide/log-storage-analysis.md | 2 +- .../data-operate/export/export-overview.md | 41 +++++++++++----------- .../practical-guide/log-storage-analysis.md | 2 +- versioned_sidebars/version-3.0-sidebars.json | 7 ---- 16 files changed, 81 insertions(+), 81 deletions(-) diff --git a/blog/migrate-lakehouse-from-bigquery-to-doris.md b/blog/migrate-lakehouse-from-bigquery-to-doris.md index 671f85c2eb..5189770f95 100644 --- a/blog/migrate-lakehouse-from-bigquery-to-doris.md +++ b/blog/migrate-lakehouse-from-bigquery-to-doris.md @@ -170,7 +170,7 @@ The implementation was carried out by 1 Data Engineer, 1 Software Engineer, and - It supports seamless data import from Apache Iceberg. The Machine Learning and data mining team can directly import data without needing to create a separate pipeline like with BigQuery. -- It supports vector data storage for AI chatbots. Data can be directly imported from the File Store Service (S3) instead of having to push it to Redis as before. +- It supports [vector data storage](https://python.langchain.com/v0.2/docs/integrations/vectorstores/apache_doris/) for AI chatbots. Data can be directly imported from the File Store Service (S3) instead of having to push it to Redis as before. - It provides efficient data aggregation through the Rollup mechanism. diff --git a/docs/data-operate/export/export-overview.md b/docs/data-operate/export/export-overview.md index fc90d2b6e7..29a45cf5c1 100644 --- a/docs/data-operate/export/export-overview.md +++ b/docs/data-operate/export/export-overview.md @@ -86,26 +86,27 @@ Parquet and ORC file formats have their own data types. Doris's export function The following table shows the mapping between Doris data types and Parquet, ORC file format data types: 1. Doris export to ORC file format data type mapping table: - | Doris Type | Orc Type | - | ----- | ----- | - | boolean | boolean | - | tinyint | tinyint | - | smallint | smallint | - | int | int | - | bigint | bigint | - | largeInt | string | - | date | string | - | datev2 | string | - | datetime | string | - | datetimev2 | timestamp | - | float | float | - | double | double | - | char / varchar / string | string | - | decimal | decimal | - | struct | struct | - | map | map | - | array | array | - |json| Not support| + + |Doris Type|Orc Type| + | -------- | ------- | + |boolean|boolean| + |tinyint|tinyint| + |smallint|smallint| + |int|int| + |bigint|bigint| + |largeInt|string| + |date|string| + |datev2|string| + |datetime|string| + |datetimev2|timestamp| + |float|float| + |double|double| + |char / varchar / string|string| + |decimal|decimal| + |struct|struct| + |map|map| + |array|array| + |json| Not supported| 2. When Doris exports to Parquet file format, it first converts Doris in-memory data to Arrow in-memory data format, then writes out to Parquet file format. The mapping relationship between Doris data types and Arrow data types is: diff --git a/docs/practical-guide/log-storage-analysis.md b/docs/practical-guide/log-storage-analysis.md index d1c15772e8..99d251de70 100644 --- a/docs/practical-guide/log-storage-analysis.md +++ b/docs/practical-guide/log-storage-analysis.md @@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`. Refer to the followin | Configuration fields to be optimized | Description | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | Increase the parameter value to adapt to high-concurrency import transactions. | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | | `enable_round_robin_create_tablet = true` | When creating Tablets, use a Round Robin strategy to distribute evenly. | | `tablet_rebalancer_type = partition` | When balancing Tablets, use a strategy to evenly distribute within each partition. | | `enable_single_replica_load = true` | Enable single-replica import, where multiple replicas only need to build an index once to reduce CPU consumption. | diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md index 5fa29031fe..4a5441a640 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md @@ -86,8 +86,9 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 以下是 Apache Doris 数据类型和 Parquet、ORC 文件格式的数据类型映射关系表: 1. Doris 导出到 Orc 文件格式的数据类型映射表: + |Doris Type|Orc Type| - | ----- | ----- | + | -------- | ------- | |boolean|boolean| |tinyint|tinyint| |smallint|smallint| @@ -107,7 +108,7 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 |array|array| |json|不支持| - <br/> + 2. Apache Doris 导出到 Parquet 文件格式时,会先将 Apache Doris 内存数据转换为 Arrow 内存数据格式,然后由 Arrow 写出到 Parquet 文件格式。Apache Doris 数据类型到 Arrow 数据类的映射关系为: |Doris Type|Arrow Type| diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md index c079a16380..a6c5dd1a1c 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md @@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持 | 需调整参数 | 说明 | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | 高并发导入运行事务数较多,需调高参数。 | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | | `enable_round_robin_create_tablet = true` | 创建 Tablet 时,采用 Round Robin 策略,尽量均匀。 | | `tablet_rebalancer_type = partition` | 均衡 Tablet 时,采用每个分区内尽量均匀的策略。 | | `enable_single_replica_load = true` | 开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 | diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md index 0430c5eae8..1a358d8a3b 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md @@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持 | 需调整参数 | 说明 | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | 高并发导入运行事务数较多,需调高参数。 | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | | `enable_round_robin_create_tablet = true` | 创建 Tablet 时,采用 Round Robin 策略,尽量均匀。 | | `tablet_rebalancer_type = partition` | 均衡 Tablet 时,采用每个分区内尽量均匀的策略。 | | `enable_single_replica_load = true` | 开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 | diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md index 76552b3edd..3e5e736f9b 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md @@ -86,8 +86,9 @@ Parquet、ORC 文件格式拥有自己的数据类型。Doris 的导出功能能 以下是 Doris 数据类型和 Parquet、ORC 文件格式的数据类型映射关系表: 1. Doris 导出到 Orc 文件格式的数据类型映射表: + |Doris Type|Orc Type| - | ----- | ----- | + | -------- | ------- | |boolean|boolean| |tinyint|tinyint| |smallint|smallint| @@ -105,8 +106,8 @@ Parquet、ORC 文件格式拥有自己的数据类型。Doris 的导出功能能 |struct|struct| |map|map| |array|array| + |json|不支持| - <br/> 2. Doris 导出到 Parquet 文件格式时,会先将 Doris 内存数据转换为 arrow 内存数据格式,然后由 arrow 写出到 parquet 文件格式。Doris 数据类型到 arrow 数据类的映射关系为: |Doris Type|Arrow Type| diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md index 424fccbb89..50f3a02571 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md @@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持 | 需调整参数 | 说明 | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | 高并发导入运行事务数较多,需调高参数。 | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | | `enable_round_robin_create_tablet = true` | 创建 Tablet 时,采用 Round Robin 策略,尽量均匀。 | | `tablet_rebalancer_type = partition` | 均衡 Tablet 时,采用每个分区内尽量均匀的策略。 | | `enable_single_replica_load = true` | 开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 | diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md index 5fa29031fe..4a5441a640 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md @@ -86,8 +86,9 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 以下是 Apache Doris 数据类型和 Parquet、ORC 文件格式的数据类型映射关系表: 1. Doris 导出到 Orc 文件格式的数据类型映射表: + |Doris Type|Orc Type| - | ----- | ----- | + | -------- | ------- | |boolean|boolean| |tinyint|tinyint| |smallint|smallint| @@ -107,7 +108,7 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 |array|array| |json|不支持| - <br/> + 2. Apache Doris 导出到 Parquet 文件格式时,会先将 Apache Doris 内存数据转换为 Arrow 内存数据格式,然后由 Arrow 写出到 Parquet 文件格式。Apache Doris 数据类型到 Arrow 数据类的映射关系为: |Doris Type|Arrow Type| diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md index c079a16380..a6c5dd1a1c 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md @@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持 | 需调整参数 | 说明 | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | 高并发导入运行事务数较多,需调高参数。 | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | 高频导入事务标签内存占用多,保留时间调短。 | | `enable_round_robin_create_tablet = true` | 创建 Tablet 时,采用 Round Robin 策略,尽量均匀。 | | `tablet_rebalancer_type = partition` | 均衡 Tablet 时,采用每个分区内尽量均匀的策略。 | | `enable_single_replica_load = true` | 开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 | diff --git a/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md b/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md index 8d096b974c..23b13a1006 100644 --- a/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md +++ b/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md @@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`. Refer to the followin | Configuration fields to be optimized | Description | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | Increase the parameter value to adapt to high-concurrency import transactions. | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | | `enable_round_robin_create_tablet = true` | When creating Tablets, use a Round Robin strategy to distribute evenly. | | `tablet_rebalancer_type = partition` | When balancing Tablets, use a strategy to evenly distribute within each partition. | | `enable_single_replica_load = true` | Enable single-replica import, where multiple replicas only need to build an index once to reduce CPU consumption. | diff --git a/versioned_docs/version-2.1/data-operate/export/export-overview.md b/versioned_docs/version-2.1/data-operate/export/export-overview.md index 3a3a2abf41..5a9897acc2 100644 --- a/versioned_docs/version-2.1/data-operate/export/export-overview.md +++ b/versioned_docs/version-2.1/data-operate/export/export-overview.md @@ -86,25 +86,27 @@ Parquet and ORC file formats have their own data types. Doris's export function The following table shows the mapping between Doris data types and Parquet, ORC file format data types: 1. Doris export to ORC file format data type mapping table: - | Doris Type | Orc Type | - | ----- | ----- | - | boolean | boolean | - | tinyint | tinyint | - | smallint | smallint | - | int | int | - | bigint | bigint | - | largeInt | string | - | date | string | - | datev2 | string | - | datetime | string | - | datetimev2 | timestamp | - | float | float | - | double | double | - | char / varchar / string | string | - | decimal | decimal | - | struct | struct | - | map | map | - | array | array | + + |Doris Type|Orc Type| + | -------- | ------- | + |boolean|boolean| + |tinyint|tinyint| + |smallint|smallint| + |int|int| + |bigint|bigint| + |largeInt|string| + |date|string| + |datev2|string| + |datetime|string| + |datetimev2|timestamp| + |float|float| + |double|double| + |char / varchar / string|string| + |decimal|decimal| + |struct|struct| + |map|map| + |array|array| + |json| Not supported| 2. When Doris exports to Parquet file format, it first converts Doris in-memory data to Arrow in-memory data format, then writes out to Parquet file format. The mapping relationship between Doris data types and Arrow data types is: diff --git a/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md b/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md index a834ee462f..61657d0398 100644 --- a/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md +++ b/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md @@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`. Refer to the followin | Configuration fields to be optimized | Description | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | Increase the parameter value to adapt to high-concurrency import transactions. | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | | `enable_round_robin_create_tablet = true` | When creating Tablets, use a Round Robin strategy to distribute evenly. | | `tablet_rebalancer_type = partition` | When balancing Tablets, use a strategy to evenly distribute within each partition. | | `enable_single_replica_load = true` | Enable single-replica import, where multiple replicas only need to build an index once to reduce CPU consumption. | diff --git a/versioned_docs/version-3.0/data-operate/export/export-overview.md b/versioned_docs/version-3.0/data-operate/export/export-overview.md index fc90d2b6e7..75b7247ed9 100644 --- a/versioned_docs/version-3.0/data-operate/export/export-overview.md +++ b/versioned_docs/version-3.0/data-operate/export/export-overview.md @@ -86,26 +86,27 @@ Parquet and ORC file formats have their own data types. Doris's export function The following table shows the mapping between Doris data types and Parquet, ORC file format data types: 1. Doris export to ORC file format data type mapping table: - | Doris Type | Orc Type | - | ----- | ----- | - | boolean | boolean | - | tinyint | tinyint | - | smallint | smallint | - | int | int | - | bigint | bigint | - | largeInt | string | - | date | string | - | datev2 | string | - | datetime | string | - | datetimev2 | timestamp | - | float | float | - | double | double | - | char / varchar / string | string | - | decimal | decimal | - | struct | struct | - | map | map | - | array | array | - |json| Not support| + + |Doris Type|Orc Type| + | -------- | ------- | + |boolean|boolean| + |tinyint|tinyint| + |smallint|smallint| + |int|int| + |bigint|bigint| + |largeInt|string| + |date|string| + |datev2|string| + |datetime|string| + |datetimev2|timestamp| + |float|float| + |double|double| + |char / varchar / string|string| + |decimal|decimal| + |struct|struct| + |map|map| + |array|array| + |json| Not supported| 2. When Doris exports to Parquet file format, it first converts Doris in-memory data to Arrow in-memory data format, then writes out to Parquet file format. The mapping relationship between Doris data types and Arrow data types is: diff --git a/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md b/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md index d1c15772e8..99d251de70 100644 --- a/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md +++ b/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md @@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`. Refer to the followin | Configuration fields to be optimized | Description | | :----------------------------------------------------------- | :----------------------------------------------------------- | | `max_running_txn_num_per_db = 10000` | Increase the parameter value to adapt to high-concurrency import transactions. | -| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | +| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` | Increase the retention time to handle high-frequency import transactions with high memory usage. | | `enable_round_robin_create_tablet = true` | When creating Tablets, use a Round Robin strategy to distribute evenly. | | `tablet_rebalancer_type = partition` | When balancing Tablets, use a strategy to evenly distribute within each partition. | | `enable_single_replica_load = true` | Enable single-replica import, where multiple replicas only need to build an index once to reduce CPU consumption. | diff --git a/versioned_sidebars/version-3.0-sidebars.json b/versioned_sidebars/version-3.0-sidebars.json index d5fa282479..625bc35d8d 100644 --- a/versioned_sidebars/version-3.0-sidebars.json +++ b/versioned_sidebars/version-3.0-sidebars.json @@ -1552,13 +1552,6 @@ "faq/sql-faq", "faq/lakehouse-faq" ] - }, - { - "type": "category", - "label": "Release notes", - "items": [ - "releasenotes/release-3.0.0" - ] } ] } \ No newline at end of file --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org