Abhishek01911 opened a new issue, #17045: URL: https://github.com/apache/pinot/issues/17045
Justification -- The ensureAllPartitionsConsuming function validates and repairs LLC (Low-Level Consumer) realtime segments to ensure all stream partitions have active consuming segments. It's called by the RealtimeSegmentValidationManager periodic task to detect and fix stuck partitions. Currently we have seen if a topic gets deleted for multi topic table, it causes ingestion stop for the all the topics of the table. This happens due to uncaught exception in [code](https://github.com/apache/pinot/blob/release-1.3.0-rc/pinot-controller/src/main/java/org/apache/pinot/controller/helix/core/realtime/PinotLLCRealtimeSegmentManager.java#L944C15-L944C43). Here we try to fetch partition count for deleted kafka topic, and hence it results in an exception. Catching this exception would prevent complete degradation and hence this exception should be caught and emitted as an metric, based on it we can alert the end user. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
