anirbanroyeasygo commented on issue #4457: URL: https://github.com/apache/iceberg/issues/4457#issuecomment-2620360127
I am getting a similar issue consuming from kinesis and merging into an iceberg table. My spark structured streaming code works when its inserting the data into a new file but whenever it tries to access an older partition(not caused by s3 eventual consistency issue), it fails with the following issue: ``` 2025-01-29 00:36:30,223 ERROR [Executor task launch worker for task 16177.0 in stage 31.0 (TID 68551)] source.BaseDataReader (BaseDataReader.java:next(120)): Error reading file: s3://path/00198-11348490-64a86129-f447-4497-8898-5bbaf58d64f4-00001.parquetorg.apache.iceberg.exceptions.RuntimeIOException: javax.net.ssl.SSLException: Connection reset at org.apache.iceberg.parquet.VectorizedParquetReader$FileIterator.advance(VectorizedParquetReader.java:165) ~[iceberg-spark-runtime-3.3_2.12-1.0.0.jar:?] at org.apache.iceberg.parquet.VectorizedParquetReader$FileIterator.next(VectorizedParquetReader.java:141) ~[iceberg-spark-runtime-3.3_2.12-1.0.0.jar:?] at org.apache.iceberg.spark.source.BaseDataReader.next(BaseDataReader.java:107) ~[iceberg-spark-runtime-3.3_2.12-1.0.0.jar:?] at org.apache.spark.sql.execution.datasources.v2.PartitionIterator.hasNext(DataSourceRDD.scala:119) ~[spark-sql_2.12-3.3.0-amzn-1.j ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org