lkindere opened a new issue, #12549:
URL: https://github.com/apache/iceberg/issues/12549

   ### Query engine
   
   Kafka Connect IcebergSinkConnector
   
   ### Question
   
   Hello,
   
   Every so often I noticed that the IcebergSinkConnector is missing some 
records which can be seen when when querying the iceberg tables.
   
   For example we store the Kafka partition and Kafka offset in the tables for 
debugging.
   
   And with this query:
   SELECT * FROM <table>
   WHERE
     __kafka_metadata.__kafka_partition = 1
   AND
     __kafka_metadata.__kafka_offset >= 175807
   AND
     __kafka_metadata.__kafka_offset <= 175809
   
   We would expect to see 3 rows, with offsets 175807, 175808 and 175809, 
however the record with the offset 175808 is missing and we only see 175807 and 
175809.
   
   Checking the logs by ingestion time we do not see any abnormalities or 
errors.
   
   Has anyone run into something similar before and knows what may be the cause?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to