It's hard to believe we've made such great progress in just one year! We'll
keep up the pace and continue working hard to make Cloudberry the best
PostgreSQL-native MPP database.

Best,
Lirong


Dianjin Wang <[email protected]> 于2025年10月22日周三 14:37写道:

> Hi all,
>
> It has been one year since Apache Cloudberry officially joined the
> Apache Incubator — a great time to look back on our journey so far!
>
> Our Apache Cloudberry Roadmap was the first large open discussion we
> launched after entering the Incubator, aiming to gather community
> feedback and align on development priorities. Now, after a year of
> active collaboration and contributions from many community members,
> we’d like to share a brief recap of our progress.
>
> 1. Cherry-pick from Greenplum to Cloudberry (Highest Priority)
>
> Now this task is almost done. You can track the progress here:
> https://lists.apache.org/thread/bf4n0p6jt8x2wnsmgwqwmqqboy4kq0st
>
> 2. PostgreSQL Kernel Upgrade
>
> PostgreSQL 14 ~> PostgreSQL 16 kernel upgrade work is in progress:
> https://lists.apache.org/thread/1b5sr96315txsvs1zg65vsd1n01kf0ql
>
> 3. Performance and Usability
>
> a) Support hybrid Row-Column storage, inspired by Partition Attributes
> Across (https://www.vldb.org/conf/2001/P169.pdf), which has the same
> write performance as AO tables and the same read performance as AOCS
> tables. We will also integrate the latest compression algorithms and
> encoding algorithms (such as dictionary encoding) into it.
>   - This has been done, see
> https://github.com/apache/cloudberry/tree/main/contrib/pax_storage
> b) Refactor the Materialized view and query for external tables.
> c) Support parallel execution in ORCA.
>   -  See https://github.com/apache/cloudberry/pull/1398
> d) Parallel query optimization to support more SQL operators.
>   - See https://github.com/apache/cloudberry/pull/1261
>
> 4. Availability Improvements
>
> a) Support hot (read-only) standby
>   - See https://github.com/apache/cloudberry/pull/1268
> b) Robust resource groups isolation - IO/CPU/Memory/Network
>   - Already supported in Kernel
>
> 5. Functionality Improvements
>
> * Pg_hint_plan for ORCA: done.
>
> 6. Streaming / Real-time
>
> a) mplementing kafka_fdw extension to enable streaming data from Kafka
> to Cloudberry.
>   - See https://github.com/cloudberry-contrib/kafka_fdw
> b) Integration with Flink CDC / Kafka connector to support near
> real-time data integration.
>   - See
>     - Flink Connector JDBC -
>
> https://github.com/apache/flink-connector-jdbc/commit/544275c8c8b03426b71192b0dde39bc51c041bab
> c) Support Dynamic Tables.
>   - See https://cloudberry.apache.org/docs/performance/use-dynamic-tables
>
> 7. Utilities and Ecosystem
>
> a) Cherry-pick the latest commits from the original Greenplum projects
> to Cloudberry, including cloudberry-pxf, cloudberry-gpbackup,
> cloudberry-gpbackup-s3-plugin, cloudberry-go-libs.
>   - cloudberry-gpbackup has been renamed to cloudberry-backup, its
> codebase has synced with the GP’s archived version:
> https://github.com/apache/cloudberry-backup
>   - cloudberry-go-libs: cloudberry-go-libs has synced with the GP’s
> archived version: https://github.com/apache/cloudberry-go-libs
>   - cloudberry-gpbackup-s3-plugin: this repo has been archived and its
> core files are merged into the cloudberry-backup:
> https://github.com/apache/cloudberry-backup/tree/main/plugins/s3plugin
>   - cloudberry-pxf: still in progress on the archived commits sync to
> Cloudberry.
> b) Support PGRX to support writing UDFs in Rust in Cloudberry.
>   - See https://github.com/cloudberry-contrib/pgrx
> c) DBeaver for Cloudberry
>   - It has supported Cloudberry since its 25.2.2:
> https://github.com/dbeaver/dbeaver/releases
> d) JDBC/ODBC for Cloudberry
>   - we can use the PostgreSQL JDBC/ODBC drive for Cloudberry
> e) Integrations with other ASF projects
>   - Apache SeaTunnel (done)
>   - Apache MADlib (WIP)
>
> 8. Release Management
>
> a) First Apache release
>   - the first Apache release can be downloaded here:
> https://cloudberry.apache.org/releases
>
> b) Release Process
>   - Documented release procedures following Apache guidelines
>     - See https://github.com/apache/cloudberry/wiki
>   - Automated release preparation and verification tools
>     - See https://github.com/apache/cloudberry/tree/main/devops/release
>   - Release notes and migration guides for each version
>   - Security vulnerability handling process
>     - See https://github.com/apache/cloudberry/blob/main/SECURITY.md
>
> c) Pipelines
>   -  Introduce the new build, test, and deployment workflows for
> Cloudberry based on GitHub Actions and Docker.
>     - See https://github.com/apache/cloudberry/tree/main/.github/workflows
>
> 9. Website, Documents, and Marketing
>
> We’ve also made steady progress on documentation, website content, and
> community outreach, strengthening Cloudberry’s visibility and
> engagement in both the Apache and PostgreSQL ecosystems.
>
> Note: if something is ignored, welcome to have your comments.
>
> ~~~
>
> This milestone would not have been possible without the help of our
> community contributors, mentors, and everyone who has been part of
> this journey.
>
> Thank you all for your continued support and contributions! Let’s keep
> up the great work and make Cloudberry even better.
>
>
> Best,
> Dianjin Wang
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

Reply via email to