shuitai commented on issue #10896: URL: https://github.com/apache/pinot/issues/10896#issuecomment-1962762566
@chenboat If there are duplicated records in batch ingestion, upsert feature could dedup them. If overwriting batch data, the client needs to dedup by spark, flink or other tools. The upsert is the most advantage of pinot compared to druid, if do upsert in pinot, it could help customer save ETL costs. So upsert feature in batch is also very useful. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org