navina opened a new issue, #10996:
URL: https://github.com/apache/pinot/issues/10996

   Issue is to open-up a discussion around:
   1. Why and in what use-cases, does it make sense to use the stream level 
consumption model in Pinot ?
   2. What are the semantics offered by the stream level consumption model. Eg. 
how does data from the source get partitioned into Pinot tables ? How is the 
consumption monitored in this model?  Iiuc, segment name convention is also 
different? 
   3.  Some feature differences I have noticed are (please correct, if I am 
mistaken). I am sure there are more. 
   
    Feature | HLC | LLC 
   ---|---|---
   Force commit | No | Yes 
   Stream Message metadata extraction | No (can potentially be extended) | Yes 
   Ingestion throttling | No | Yes
   
   4. Documentation is sparse about this usage and its guarantees. Iirc, there 
were a few examples in OSS documentation which used high level consumer. Users 
have mistakenly used these samples with `ConsumerType.HIGHLEVEL` and ended up 
in long debugging sessions. One example is 
https://apache-pinot.slack.com/archives/CDRCA57FC/p1687987849496959?thread_ts=1687912445.703689&cid=CDRCA57FC.
 (when the original incident happened, we spent ~1-2 days debugging before 
realizing that the stream type is high level)
   
   I would like to propose that we find a path to sunset the stream level 
consumption model. but I don't want to proceed without understanding the above 
questions. Please help clarify. 
   
   I also see comments like "This can be removed once we remove HLC 
implementation from the code" 
[link](https://github.com/apache/pinot/blob/master/pinot-spi/src/main/java/org/apache/pinot/spi/stream/PartitionLevelStreamConfig.java#L30)
  . So, I am assuming this topic has come up before for discussion :) 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to