This is an automated email from the ASF dual-hosted git repository. kassiez pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push: new af549bf066e [improve](data-connect)Add Version Descriptions for Arrow Flight Support in Flink/Spark Doris Connectors (#2066) af549bf066e is described below commit af549bf066e18d241b20af7ac7ce87ff1ad98390 Author: Petrichor <1401597...@qq.com> AuthorDate: Tue Feb 18 17:45:20 2025 +0800 [improve](data-connect)Add Version Descriptions for Arrow Flight Support in Flink/Spark Doris Connectors (#2066) ## Versions - [x] dev - [x] 3.0 - [x] 2.1 - [ ] 2.0 ## Languages - [x] Chinese - [x] English ## Docs Checklist - [ ] Checked by AI - [ ] Test Cases Built --- docs/db-connect/arrow-flight-sql-connect.md | 2 +- .../current/db-connect/arrow-flight-sql-connect.md | 2 +- .../version-2.1/db-connect/arrow-flight-sql-connect.md | 2 +- .../version-3.0/db-connect/arrow-flight-sql-connect.md | 2 +- versioned_docs/version-2.1/db-connect/arrow-flight-sql-connect.md | 2 +- versioned_docs/version-3.0/db-connect/arrow-flight-sql-connect.md | 2 +- 6 files changed, 6 insertions(+), 6 deletions(-) diff --git a/docs/db-connect/arrow-flight-sql-connect.md b/docs/db-connect/arrow-flight-sql-connect.md index 54382516bb9..d3d99682e7e 100644 --- a/docs/db-connect/arrow-flight-sql-connect.md +++ b/docs/db-connect/arrow-flight-sql-connect.md @@ -465,7 +465,7 @@ Compared with the traditional `jdbc:mysql` connection method, the performance te ### Spark & Flink -Arrow Flight currently has no official plan to support Spark and Flink ([GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)). Doris' own [Spark Connector](https://github.com/apache/doris-spark-connector) and [Flink Connector](https://github.com/apache/doris-flink-connector) do not currently support accessing Doris through Arrow Flight SQL. Among them, Doris Flink Connector supporting Arrow Flight SQL is under development, and it is expected to improve the reading performance [...] +Arrow Flight currently has no official plan to support Spark and Flink ([GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)). Since version 24.0.0, Doris' own [Spark Connector](https://github.com/apache/doris-spark-connector) and [Flink Connector](https://github.com/apache/doris-flink-connector) have supported accessing Doris via Arrow Flight SQL, and it is expected that this will improve the reading performance several times. The community previously referred to the open source [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connector) and used FlightClient in Spark to connect to Doris for testing. It was found that the data format conversion between Arrow and Doris Block is faster, which is 10 times the conversion speed between CSV format and Doris Block, and it has better support for complex types such as Map and Array. This is because the Arrow data format has a high compression rate and lo [...] diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/db-connect/arrow-flight-sql-connect.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/db-connect/arrow-flight-sql-connect.md index 894c69ae57d..4245a577955 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/db-connect/arrow-flight-sql-connect.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/db-connect/arrow-flight-sql-connect.md @@ -465,7 +465,7 @@ try ( ### Spark & Flink -Arrow Flight 官方目前没有支持 Spark 和 Flink 的计划(见 [GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)),[Doris Spark Connector](https://github.com/apache/doris-spark-connector) 和 [Doris Flink Connector](https://github.com/apache/doris-flink-connector) 目前还不支持通过 Arrow Flight SQL 访问 Doris。其中 Doris Flink Connector 支持 Arrow Flight SQL 正在开发中,预期能提升数倍读取性能。 +Arrow Flight 官方目前没有支持 Spark 和 Flink 的计划(见 [GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)),[Doris Spark Connector](https://github.com/apache/doris-spark-connector) 和 [Doris Flink Connector](https://github.com/apache/doris-flink-connector) 自 24.0.0 开始支持通过 Arrow Flight SQL 访问 Doris,预期能提升数倍读取性能。 社区之前参考开源的 [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connector),在 Spark 中使用 FlightClient 连接 Doris 测试,发现 Arrow 与 Doris Block 之间数据格式转换的速度更快,是 CSV 格式与 Doris Block 之间转换速度的 10 倍,而且对 Map,Array 等复杂类型的支持更好,这是因为 Arrow 数据格式的压缩率高,传输时网络开销小。不过目前 Doris Arrow Flight 还没有实现多节点并行读取,仍是将查询结果汇总到一台 BE 节点后返回,对简单的批量导出数据而言,性能可能没有 Doris Spark Connector 快,后者支持 Tablet 级别的并行读取。如果你希望在 Spark 使用 Arrow Flight SQL 连接 Doris,可以参考开源的 [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connec [...] diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/db-connect/arrow-flight-sql-connect.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/db-connect/arrow-flight-sql-connect.md index e10c601ac7d..839f585c745 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/db-connect/arrow-flight-sql-connect.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/db-connect/arrow-flight-sql-connect.md @@ -465,7 +465,7 @@ try ( ### Spark & Flink -Arrow Flight 官方目前没有支持 Spark 和 Flink 的计划(见 [GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)),[Doris Spark Connector](https://github.com/apache/doris-spark-connector) 和 [Doris Flink Connector](https://github.com/apache/doris-flink-connector) 目前还不支持通过 Arrow Flight SQL 访问 Doris。其中 Doris Flink Connector 支持 Arrow Flight SQL 正在开发中,预期能提升数倍读取性能。 +Arrow Flight 官方目前没有支持 Spark 和 Flink 的计划(见 [GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)),[Doris Spark Connector](https://github.com/apache/doris-spark-connector) 和 [Doris Flink Connector](https://github.com/apache/doris-flink-connector) 自 24.0.0 开始支持通过 Arrow Flight SQL 访问 Doris,预期能提升数倍读取性能。 社区之前参考开源的 [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connector),在 Spark 中使用 FlightClient 连接 Doris 测试,发现 Arrow 与 Doris Block 之间数据格式转换的速度更快,是 CSV 格式与 Doris Block 之间转换速度的 10 倍,而且对 Map,Array 等复杂类型的支持更好,这是因为 Arrow 数据格式的压缩率高,传输时网络开销小。不过目前 Doris Arrow Flight 还没有实现多节点并行读取,仍是将查询结果汇总到一台 BE 节点后返回,对简单的批量导出数据而言,性能可能没有 Doris Spark Connector 快,后者支持 Tablet 级别的并行读取。如果你希望在 Spark 使用 Arrow Flight SQL 连接 Doris,可以参考开源的 [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connec [...] diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/db-connect/arrow-flight-sql-connect.md b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/db-connect/arrow-flight-sql-connect.md index e10c601ac7d..839f585c745 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/db-connect/arrow-flight-sql-connect.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/db-connect/arrow-flight-sql-connect.md @@ -465,7 +465,7 @@ try ( ### Spark & Flink -Arrow Flight 官方目前没有支持 Spark 和 Flink 的计划(见 [GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)),[Doris Spark Connector](https://github.com/apache/doris-spark-connector) 和 [Doris Flink Connector](https://github.com/apache/doris-flink-connector) 目前还不支持通过 Arrow Flight SQL 访问 Doris。其中 Doris Flink Connector 支持 Arrow Flight SQL 正在开发中,预期能提升数倍读取性能。 +Arrow Flight 官方目前没有支持 Spark 和 Flink 的计划(见 [GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)),[Doris Spark Connector](https://github.com/apache/doris-spark-connector) 和 [Doris Flink Connector](https://github.com/apache/doris-flink-connector) 自 24.0.0 开始支持通过 Arrow Flight SQL 访问 Doris,预期能提升数倍读取性能。 社区之前参考开源的 [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connector),在 Spark 中使用 FlightClient 连接 Doris 测试,发现 Arrow 与 Doris Block 之间数据格式转换的速度更快,是 CSV 格式与 Doris Block 之间转换速度的 10 倍,而且对 Map,Array 等复杂类型的支持更好,这是因为 Arrow 数据格式的压缩率高,传输时网络开销小。不过目前 Doris Arrow Flight 还没有实现多节点并行读取,仍是将查询结果汇总到一台 BE 节点后返回,对简单的批量导出数据而言,性能可能没有 Doris Spark Connector 快,后者支持 Tablet 级别的并行读取。如果你希望在 Spark 使用 Arrow Flight SQL 连接 Doris,可以参考开源的 [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connec [...] diff --git a/versioned_docs/version-2.1/db-connect/arrow-flight-sql-connect.md b/versioned_docs/version-2.1/db-connect/arrow-flight-sql-connect.md index 54382516bb9..d3d99682e7e 100644 --- a/versioned_docs/version-2.1/db-connect/arrow-flight-sql-connect.md +++ b/versioned_docs/version-2.1/db-connect/arrow-flight-sql-connect.md @@ -465,7 +465,7 @@ Compared with the traditional `jdbc:mysql` connection method, the performance te ### Spark & Flink -Arrow Flight currently has no official plan to support Spark and Flink ([GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)). Doris' own [Spark Connector](https://github.com/apache/doris-spark-connector) and [Flink Connector](https://github.com/apache/doris-flink-connector) do not currently support accessing Doris through Arrow Flight SQL. Among them, Doris Flink Connector supporting Arrow Flight SQL is under development, and it is expected to improve the reading performance [...] +Arrow Flight currently has no official plan to support Spark and Flink ([GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)). Since version 24.0.0, Doris' own [Spark Connector](https://github.com/apache/doris-spark-connector) and [Flink Connector](https://github.com/apache/doris-flink-connector) have supported accessing Doris via Arrow Flight SQL, and it is expected that this will improve the reading performance several times. The community previously referred to the open source [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connector) and used FlightClient in Spark to connect to Doris for testing. It was found that the data format conversion between Arrow and Doris Block is faster, which is 10 times the conversion speed between CSV format and Doris Block, and it has better support for complex types such as Map and Array. This is because the Arrow data format has a high compression rate and lo [...] diff --git a/versioned_docs/version-3.0/db-connect/arrow-flight-sql-connect.md b/versioned_docs/version-3.0/db-connect/arrow-flight-sql-connect.md index 54382516bb9..d3d99682e7e 100644 --- a/versioned_docs/version-3.0/db-connect/arrow-flight-sql-connect.md +++ b/versioned_docs/version-3.0/db-connect/arrow-flight-sql-connect.md @@ -465,7 +465,7 @@ Compared with the traditional `jdbc:mysql` connection method, the performance te ### Spark & Flink -Arrow Flight currently has no official plan to support Spark and Flink ([GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)). Doris' own [Spark Connector](https://github.com/apache/doris-spark-connector) and [Flink Connector](https://github.com/apache/doris-flink-connector) do not currently support accessing Doris through Arrow Flight SQL. Among them, Doris Flink Connector supporting Arrow Flight SQL is under development, and it is expected to improve the reading performance [...] +Arrow Flight currently has no official plan to support Spark and Flink ([GitHub Issue](https://github.com/apache/arrow-adbc/issues/1490)). Since version 24.0.0, Doris' own [Spark Connector](https://github.com/apache/doris-spark-connector) and [Flink Connector](https://github.com/apache/doris-flink-connector) have supported accessing Doris via Arrow Flight SQL, and it is expected that this will improve the reading performance several times. The community previously referred to the open source [Spark-Flight-Connector](https://github.com/qwshen/spark-flight-connector) and used FlightClient in Spark to connect to Doris for testing. It was found that the data format conversion between Arrow and Doris Block is faster, which is 10 times the conversion speed between CSV format and Doris Block, and it has better support for complex types such as Map and Array. This is because the Arrow data format has a high compression rate and lo [...] --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org