Re: [I] Detecting duplicates in the Flink Data Stream API [iceberg]

2025-02-02 Thread via GitHub
github-actions[bot] commented on issue #10683: URL: https://github.com/apache/iceberg/issues/10683#issuecomment-2629642189 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache

Re: [I] Detecting duplicates in the Flink Data Stream API [iceberg]

2025-02-02 Thread via GitHub
github-actions[bot] closed issue #10683: Detecting duplicates in the Flink Data Stream API URL: https://github.com/apache/iceberg/issues/10683 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

Re: [I] Detecting duplicates in the Flink Data Stream API [iceberg]

2024-07-22 Thread via GitHub
pvary commented on issue #10683: URL: https://github.com/apache/iceberg/issues/10683#issuecomment-2242830276 Thanks @lkokhreidze! Currently there is no such thing in the Flink Iceberg Sink. You need to build your own operator for it. -- This is an automated message from the Apache Git

Re: [I] Detecting duplicates in the Flink Data Stream API [iceberg]

2024-07-22 Thread via GitHub
lkokhreidze commented on issue #10683: URL: https://github.com/apache/iceberg/issues/10683#issuecomment-2242798889 Hi @pvary thanks for the reply. I do not know internals of Paimon, if it inserts both rows or not. But from the reader perspective, only the first row will be visible. Behav

Re: [I] Detecting duplicates in the Flink Data Stream API [iceberg]

2024-07-19 Thread via GitHub
lkokhreidze commented on issue #10683: URL: https://github.com/apache/iceberg/issues/10683#issuecomment-2239706594 Hi @pvary thanks for the answer. Apologies, I now realise the question might have been a bit misleading. What I am trying to do is to have first row of the same primar

[I] Detecting duplicates in the Flink Data Stream API [iceberg]

2024-07-11 Thread via GitHub
lkokhreidze opened a new issue, #10683: URL: https://github.com/apache/iceberg/issues/10683 ### Query engine Flink ### Question Hi, I was wondering if there's a way we could detect if ongoing batch written to the Iceberg table would perform the upsert? Context: