Guozhang Wang created KAFKA-6038:
------------------------------------
Summary: Repartition topics could be much more transient
Key: KAFKA-6038
URL: https://issues.apache.org/jira/browse/KAFKA-6038
Project: Kafka
Issue Type: Improvement
Components: streams
Reporter: Guozhang Wang
Unlike changelog topics, the repartition topics could just be short-lived than
eating up the storage space on Kafka brokers. Today users have different ways
to configure them with short retention such as enforce a retention of 30
minutes with small log segment sizes, or use AppendTime for repartition topics.
All these would be cumbersome and Streams should just do this automatically.
One way to do it is use the “purgeData” admin API (KIP-107) such that after the
offset of the input topics are committed, if the input topics are actually
repartition topics, we would purge the data immediately. One tricky thing to
consider though, is upon (re-)starting the application, if the repartition
topics are used for restoring the states, we need to re-fill these topics in
the right way in order for restoration purposes, and there might be some devils
in the implementation details.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)