rohityadav1993 commented on PR #13837:
URL: https://github.com/apache/pinot/pull/13837#issuecomment-2323451696

   @kishoreg , extracting partitionId from segmentName is a reuse of 
[LLCSegmentName](https://github.com/apache/pinot/blob/master/pinot-common/src/main/java/org/apache/pinot/common/utils/LLCSegmentName.java)
 convention in UploadedReatimeSegmentName introduced in PR #13107
   A call was taken to utilise segmentName encoding to reduce overhead of 
querying Zk.
   
   Yes, we can support batch-only upsert table. I had a few top of my thoughts:
    - Upsert table is essentially a realtime table and canont be created 
without a stream config.
    - Offline upsert tables would the right classification though behind the 
scenes it can be a realtime table with disabled stream consumption.
    - Many batch workloads have the possibility to deduplicate data during 
offline ingestion so offline tables are still usable if we overwrite all the 
data. Real advantage will be for usecases with incremental data ingestion with 
upsert requirement.
   
   I anticipate a few usecases coming up in future for us and will be happy to 
enhance batch ingestion for upserts.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to