This is an automated email from the ASF dual-hosted git repository. morningman pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push: new 51a7b806150 [fix](outfile) add notice to outfile documents (#2081) 51a7b806150 is described below commit 51a7b806150a30e826f1beb99d9c485123967d80 Author: Tiewei Fang <ftw2...@163.com> AuthorDate: Tue Feb 25 10:56:39 2025 +0800 [fix](outfile) add notice to outfile documents (#2081) ## Versions - [ ] dev - [ ] 3.0 - [x] 2.1 - [x] 2.0 ## Languages - [x] Chinese - [x] English ## Docs Checklist - [ ] Checked by AI - [ ] Test Cases Built --- .../version-2.0/data-operate/export/outfile.md | 2 ++ .../version-2.1/data-operate/export/outfile.md | 3 +++ .../version-2.0/data-operate/export/outfile.md | 2 ++ .../version-2.1/data-operate/export/outfile.md | 16 ++++++++++------ 4 files changed, 17 insertions(+), 6 deletions(-) diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/data-operate/export/outfile.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/data-operate/export/outfile.md index b02644e24d7..d3d87e4ce37 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/data-operate/export/outfile.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/data-operate/export/outfile.md @@ -153,6 +153,8 @@ ERROR 1064 (HY000): errCode = 2, detailMessage = Open broker writer failed ... ## 注意事项 +* 当前版本 nereids 优化器不支持并发Outfile,若开启了 nereids 优化器,并发 Outfile 将直接回退老优化器,由老优化器来进行并发 Outfile 导出。同时,当前版本 pipeline 引擎也不支持并发 Outfile,若开启了 pipeline 引擎, 并发 Outfile 将回退到单并发导出。 + * 如果不开启并发导出,查询结果是由单个 BE 节点,单线程导出的。因此导出时间和导出结果集大小正相关。开启并发导出可以降低导出的时间。 * 导出命令不会检查文件及文件路径是否存在。是否会自动创建路径、或是否会覆盖已存在文件,完全由远端存储系统的语义决定。 diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/outfile.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/outfile.md index c2ac0439b9b..3a3db697050 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/outfile.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/outfile.md @@ -229,6 +229,9 @@ PROPERTIES( ## 注意事项 +- 使用并发 Outfile 时的限制 + 当前版本 pipeline 引擎不支持并发Outfile。所以若开启了 pipeline 引擎, 并发 Outfile 将回退到单并发导出。 + - 导出数据量和导出效率 `SELECT INTO OUTFILE`功能本质上是执行一个 SQL 查询命令。如果不开启并发导出,查询结果是由单个 BE 节点,单线程导出的,因此整个导出的耗时包括查询本身的耗时和最终结果集写出的耗时。开启并发导出可以降低导出的时间。 diff --git a/versioned_docs/version-2.0/data-operate/export/outfile.md b/versioned_docs/version-2.0/data-operate/export/outfile.md index 8aac8895818..68c36a39bc0 100644 --- a/versioned_docs/version-2.0/data-operate/export/outfile.md +++ b/versioned_docs/version-2.0/data-operate/export/outfile.md @@ -158,6 +158,8 @@ ERROR 1064 (HY000): errCode = 2, detailMessage = Open broker writer failed ... ## Notice +* The current version of the Nereids optimizer does not support concurrent Outfile. If the Nereids optimizer is enabled, concurrent Outfile will directly revert to the old optimizer, which will handle the concurrent Outfile export. Similarly, the current version of the pipeline engine also does not support concurrent Outfile. If the pipeline engine is enabled, concurrent Outfile will revert to single-threaded export. + * The CSV format does not support exporting binary types, such as BITMAP and HLL types. These types will be output as `\N`, which is null. * If you do not enable concurrent export, the query result is exported by a single BE node in a single thread. Therefore, the export time and the export result set size are positively correlated. Turning on concurrent export can reduce the export time. diff --git a/versioned_docs/version-2.1/data-operate/export/outfile.md b/versioned_docs/version-2.1/data-operate/export/outfile.md index b66772a0823..8d774d8fa32 100644 --- a/versioned_docs/version-2.1/data-operate/export/outfile.md +++ b/versioned_docs/version-2.1/data-operate/export/outfile.md @@ -231,29 +231,33 @@ Since `"max_file_size" = "2048MB"` is specified, if the final generated file is ## Notice -1. Export Data Volume and Export Efficiency +1. Limitations when using concurrent Outfile + + The current version of the pipeline engine does not support concurrent Outfile. Therefore, if the pipeline engine is enabled, concurrent Outfile will revert to single-threaded export. + +2. Export Data Volume and Export Efficiency The `SELECT INTO OUTFILE` function is essentially executing an SQL query command. If concurrent exports are not enabled, the query results are exported by a single BE node in a single thread. Therefore, the entire export time includes the time consumed by the query itself and the time consumed by writing out the final result set. Enabling concurrent exports can reduce the export time. -2. Export Timeout +3. Export Timeout The timeout period of the export command is the same as that of the query. If the export data times out due to a large amount of data, you can set the session variable `query_timeout` to appropriately extend the query timeout period. -3. Management of Exported Files +4. Management of Exported Files Doris does not manage the exported files. Whether the files are successfully exported or left over after a failed export, users need to handle them on their own. In addition, the `SELECT INTO OUTFILE` command does not check whether files or file paths exist. Whether the `SELECT INTO OUTFILE` command will automatically create paths or overwrite existing files is completely determined by the semantics of the remote storage system. -4. If the Query Result Set Is Empty +5. If the Query Result Set Is Empty For an export with an empty result set, an empty file will still be generated. -5. File Splitting +6. File Splitting File splitting ensures that a row of data is completely stored in a single file. Therefore, the size of the file is not strictly equal to `max_file_size`. -6. Functions with Non-visible Characters +7. Functions with Non-visible Characters For some functions whose output is non-visible characters, such as BITMAP and HLL types, when exported to the CSV file format, the output is `\N`. --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org