This is an automated email from the ASF dual-hosted git repository. dataroaring pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push: new 0c54175298 [doc](flink) update flink connector faq (#1111) 0c54175298 is described below commit 0c54175298972b6622dffcb1eb3ca82bc048902b Author: wudi <676366...@qq.com> AuthorDate: Tue Oct 29 10:08:51 2024 +0800 [doc](flink) update flink connector faq (#1111) # Versions - [ ] dev - [ ] 3.0 - [ ] 2.1 - [ ] 2.0 # Languages - [ ] Chinese - [ ] English --- common_docs_zh/ecosystem/flink-doris-connector.md | 7 +++++-- ecosystem/flink-doris-connector.md | 6 +++++- 2 files changed, 10 insertions(+), 3 deletions(-) diff --git a/common_docs_zh/ecosystem/flink-doris-connector.md b/common_docs_zh/ecosystem/flink-doris-connector.md index 02d9d60282..2fe2f9eb1a 100644 --- a/common_docs_zh/ecosystem/flink-doris-connector.md +++ b/common_docs_zh/ecosystem/flink-doris-connector.md @@ -838,7 +838,7 @@ Connector1.1.0 版本以前,是攒批写入的,写入均是由数据驱动 9. **tablet writer write failed, tablet_id=190958, txn_id=3505530, err=-235** -通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和 sink.batch.interval 参数来降低 Streamload 的频率。 +通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和 sink.batch.interval 参数来降低 Streamload 的频率。在Connector1.1.0之后,默认写入时机是由Checkpoint控制,可以通过增加Checkpoint间隔来降低写入频率。 10. **Flink 导入有脏数据,如何跳过?** @@ -865,6 +865,9 @@ Flink 在数据导入时,如果有脏数据,比如字段格式、长度等 16. **如果使用整库同步 MySQL 数据到 Doris,出现 timestamp 类型与源数据相差多个小时** - 整库同步默认 timezone="UTC+8",如果你同步的数据不是该时区,可以尝试如下设置相对应的时区,例如:`--mysql-conf debezium.date.format.timestamp.zone="UTC+3"` 来解决。 +17. **攒批写入和流式写入有什么区别** + +Connector1.5.0 之后支持攒批写入,攒批写入不依赖 Checkpoint,将数据缓存在内存中,根据 sink.buffer-flush.max-rows/sink.buffer-flush.max-bytes/sink.buffer-flush.interval 参数来控制写入时机。流式写入必须开启 Checkpoint,在整个 Checkpoint 期间持续的将上游数据写入到 Doris 中,不会一直将数据缓存在内存中。 + diff --git a/ecosystem/flink-doris-connector.md b/ecosystem/flink-doris-connector.md index 9089f04153..9e7f0aa623 100644 --- a/ecosystem/flink-doris-connector.md +++ b/ecosystem/flink-doris-connector.md @@ -830,7 +830,7 @@ Before Connector1.1.0, it was written in batches, and the writing was driven by 9. **tablet writer write failed, tablet_id=190958, txn_id=3505530, err=-235** -It usually occurs before Connector1.1.0, because the writing frequency is too fast, resulting in too many versions. The frequency of Streamload can be reduced by setting the sink.batch.size and sink.batch.interval parameters. +It usually occurs before Connector1.1.0, because the writing frequency is too fast, resulting in too many versions. The frequency of Streamload can be reduced by setting the sink.batch.size and sink.batch.interval parameters. After Connector 1.1.0, the default write timing is controlled by Checkpoint, and the write frequency can be reduced by increasing the Checkpoint interval. 10. **Flink imports dirty data, how to skip it? ** @@ -858,3 +858,7 @@ The issue may have occurred due to configuring the IP address of `be`, which is 16. **When using Flink-connector to synchronize MySQL data to Doris, there is a time difference of several hours between the timestamp.** Flink Connector synchronizes the entire database from MySQL with a default timezone of UTC+8. If your data resides in a different timezone, you can adjust it using the following configuration, for example: `--mysql-conf debezium.date.format.timestamp.zone="UTC+3"`. + +17. **What is the difference between batch writing and streaming writing** + +Connector 1.5.0 and later support batch writing. Batch writing does not rely on Checkpoint. Data is cached in memory and the writing timing is controlled according to the parameters sink.buffer-flush.max-rows/sink.buffer-flush.max-bytes/sink.buffer-flush.interval. Checkpoint must be enabled for streaming writing. During the entire Checkpoint period, upstream data is continuously written to Doris, and data is not cached in memory all the time. \ No newline at end of file --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org