rohityadav1993 opened a new pull request, #13837: URL: https://github.com/apache/pinot/pull/13837
`feature` `release-notes` Follow up of #13107 This PR adds support for uploading batch generated segments to an upsert table. A new segmentName convention `UploadedRealtimeSegmentName` was introduced which enables segment assignment for uploaded segments(not generated by stream consumers) based on the provided partition number. Changes: - Adds new config in BatchConfigMap needed for `UploadedRealtimeSegmentNameGenerator` - Adds a new integration test for upsert tables - Refactor README with an example to quick start. Test plan: - Integration test - Cluster test: - Set up a new upsert table with keys partitioned using murmur2 - Generate segments from a Flink job and upload to table - Validation: - Check the keys of a uploaded segment are consistently partitioned with realtime stream generated segments - <img width="760" alt="Screenshot 2024-08-19 at 2 26 49 PM" src="https://github.com/user-attachments/assets/56466699-cfc9-4155-b0e1-07ac04fcc02e"> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org