Hi All,

I would like to propose Fluss [1] as a new apache incubator project, and
you can find the proposal [2] of Fluss for more details.


Fluss is a distributed storage service designed to deliver high throughput
and sub-second latency for streaming read and write operations. It aims to
provide a unified data layer that bridges real-time processing with data
lakehouse architectures. Building real-time analytics pipelines on top of a
data lakehouse requires key capabilities such as tabular query support,
efficient data updates, changelog subscriptions, and the ability to
periodically snapshot data into lake file formats like Apache Iceberg and
Apache Paimon — functionalities that existing message queue systems such as
Apache Kafka are not well suited to address.


To tackle these challenges, Fluss offers the following features:


1. *Table-Oriented Data Model, Not Topics.* Unlike traditional messaging
systems that rely on topics, Fluss treats tables as first-class citizens,
aligning its data model with that of modern data lakehouses.

2. *Columnar Stream Storage.* By storing streaming data in a columnar
format (specifically Apache Arrow), Fluss achieves up to 10x faster read
performance for analytical queries over streaming data.

3. *Real-Time Updates and Changelog Subscription.* Fluss natively supports
data updates and generates fine-grained changelogs, enabling low-latency
incremental stream processing and state synchronization.

4. *Streaming & Lakehouse Unification.* Fluss enhances the stream
processing capabilities of lakehouse architectures by seamlessly supporting
both real-time ingestion and historical analysis within a single system.


Fluss is currently deployed in production environments at Alibaba and many
other companies, where it has reduced total operational costs by up to 80%
compared to traditional message queue systems in a variety of use cases. In
addition, the project has gained traction in the open-source community,
with active adoption from organizations such as ByteDance, AntGroup,
Ververica, eBay, Dynatrace, and Dream11. Many of these users have also
contributed code and improvements, helping to form a vibrant and growing
community with dozens of active developers.


The proposed initial committers are eager to join the Apache Software
Foundation (ASF) to foster broader collaboration and further strengthen the
community. We believe that bringing Fluss into the Apache Incubator will
unlock significant value for the broader open-source ecosystem.


I am honored to serve as the champion for this project and will mentor it
alongside three additional mentors (many thanks to them all):


* Becket Qin (j...@apache.org)

* Jingsong Lee (lzljs3620...@apache.org)

* Zili (Tison) Chen (ti...@apache.org)


Look forward to your feedback. Thanks.

Best Regards,
Yu

[1] https://github.com/alibaba/fluss

[2] https://cwiki.apache.org/confluence/display/INCUBATOR/FlussProposal

Reply via email to