Thank you for your support, JB. I've discussed this with the team, and we're all delighted to welcome you as a mentor for the project. I've also updated the proposal [1] accordingly to reflect your involvement.
Best Regards, Yu [1] https://cwiki.apache.org/confluence/display/INCUBATOR/FlussProposal On Mon, 26 May 2025 at 16:10, Jean-Baptiste Onofré <j...@nanthrax.net> wrote: > Hi > > Fluss is a very interesting project. I would be happy to help ! > > Regards > JB > > On Wed, May 21, 2025 at 10:43 AM Yu Li <car...@gmail.com> wrote: > > > > Hi All, > > > > > > I would like to propose Fluss [1] as a new apache incubator project, and > > you can find the proposal [2] of Fluss for more details. > > > > > > Fluss is a distributed storage service designed to deliver high > throughput > > and sub-second latency for streaming read and write operations. It aims > to > > provide a unified data layer that bridges real-time processing with data > > lakehouse architectures. Building real-time analytics pipelines on top > of a > > data lakehouse requires key capabilities such as tabular query support, > > efficient data updates, changelog subscriptions, and the ability to > > periodically snapshot data into lake file formats like Apache Iceberg and > > Apache Paimon — functionalities that existing message queue systems such > as > > Apache Kafka are not well suited to address. > > > > > > To tackle these challenges, Fluss offers the following features: > > > > > > 1. *Table-Oriented Data Model, Not Topics.* Unlike traditional messaging > > systems that rely on topics, Fluss treats tables as first-class citizens, > > aligning its data model with that of modern data lakehouses. > > > > 2. *Columnar Stream Storage.* By storing streaming data in a columnar > > format (specifically Apache Arrow), Fluss achieves up to 10x faster read > > performance for analytical queries over streaming data. > > > > 3. *Real-Time Updates and Changelog Subscription.* Fluss natively > supports > > data updates and generates fine-grained changelogs, enabling low-latency > > incremental stream processing and state synchronization. > > > > 4. *Streaming & Lakehouse Unification.* Fluss enhances the stream > > processing capabilities of lakehouse architectures by seamlessly > supporting > > both real-time ingestion and historical analysis within a single system. > > > > > > Fluss is currently deployed in production environments at Alibaba and > many > > other companies, where it has reduced total operational costs by up to > 80% > > compared to traditional message queue systems in a variety of use cases. > In > > addition, the project has gained traction in the open-source community, > > with active adoption from organizations such as ByteDance, AntGroup, > > Ververica, eBay, Dynatrace, and Dream11. Many of these users have also > > contributed code and improvements, helping to form a vibrant and growing > > community with dozens of active developers. > > > > > > The proposed initial committers are eager to join the Apache Software > > Foundation (ASF) to foster broader collaboration and further strengthen > the > > community. We believe that bringing Fluss into the Apache Incubator will > > unlock significant value for the broader open-source ecosystem. > > > > > > I am honored to serve as the champion for this project and will mentor it > > alongside three additional mentors (many thanks to them all): > > > > > > * Becket Qin (j...@apache.org) > > > > * Jingsong Lee (lzljs3620...@apache.org) > > > > * Zili (Tison) Chen (ti...@apache.org) > > > > > > Look forward to your feedback. Thanks. > > > > Best Regards, > > Yu > > > > [1] https://github.com/alibaba/fluss > > > > [2] https://cwiki.apache.org/confluence/display/INCUBATOR/FlussProposal > > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >