javrasya opened a new issue, #9444:
URL: https://github.com/apache/iceberg/issues/9444

   ### Apache Iceberg version
   
   1.4.2 (latest release)
   
   ### Query engine
   
   Flink
   
   ### Please describe the bug 🐞
   
   I am using Flink version 1.15 and trying to consume from an Iceberg table 
registered on Glue catalog and located in S3. My flink application fails 
roughly after 10 minutes with the following error;
   
   
   I tried setting the following settings with my catalog loader but no luck;
   
   ```
   http-client.apache.expect-continue-enabled
   http-client.apache.tcp-keep-alive-enabled
   http-client.apache.socket-timeout-ms
   http-client.apache.connection-timeout-ms
   ```
   
   Can someone help me to fix this problem?
   
   Thanks in advance. 
   
   
   ```java.lang.RuntimeException: One or more fetchers have encountered 
exception
        at 
org.apache.flink.connector.base.source.reader.fetcher.SplitFetcherManager.checkErrors(SplitFetcherManager.java:225)
        at 
org.apache.flink.connector.base.source.reader.SourceReaderBase.getNextFetch(SourceReaderBase.java:169)
        at 
org.apache.flink.connector.base.source.reader.SourceReaderBase.pollNext(SourceReaderBase.java:130)
        at 
org.apache.flink.connector.base.source.reader.SourceReaderBase.pollNext(SourceReaderBase.java:156)
        at 
org.apache.flink.connector.base.source.hybrid.HybridSourceReader.pollNext(HybridSourceReader.java:85)
        at 
org.apache.flink.streaming.api.operators.SourceOperator.emitNext(SourceOperator.java:385)
        at 
org.apache.flink.streaming.runtime.io.StreamTaskSourceInput.emitNext(StreamTaskSourceInput.java:68)
        at 
org.apache.flink.streaming.runtime.io.StreamOneInputProcessor.processInput(StreamOneInputProcessor.java:65)
        at 
org.apache.flink.streaming.runtime.tasks.StreamTask.processInput(StreamTask.java:519)
        at 
org.apache.flink.streaming.runtime.tasks.mailbox.MailboxProcessor.runMailboxLoop(MailboxProcessor.java:203)
        at 
org.apache.flink.streaming.runtime.tasks.StreamTask.runMailboxLoop(StreamTask.java:804)
        at 
org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:753)
        at 
org.apache.flink.runtime.taskmanager.Task.runWithSystemExitMonitoring(Task.java:953)
        at 
org.apache.flink.runtime.taskmanager.Task.restoreAndInvoke(Task.java:932)
        at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:746)
        at org.apache.flink.runtime.taskmanager.Task.run(Task.java:568)
        at java.base/java.lang.Thread.run(Thread.java:829)
   Caused by: java.lang.RuntimeException: SplitFetcher thread 11 received 
unexpected exception while polling the records
        at 
org.apache.flink.connector.base.source.reader.fetcher.SplitFetcher.runOnce(SplitFetcher.java:150)
        at 
org.apache.flink.connector.base.source.reader.fetcher.SplitFetcher.run(SplitFetcher.java:105)
        at 
java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
        at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
        at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
        at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
        ... 1 more
   Caused by: org.apache.iceberg.exceptions.RuntimeIOException: Failed to find 
sync past position 0
        at 
org.apache.iceberg.avro.AvroIterable$AvroRangeIterator.<init>(AvroIterable.java:117)
        at org.apache.iceberg.avro.AvroIterable.iterator(AvroIterable.java:83)
        at 
org.apache.iceberg.io.CloseableIterable.lambda$filter$0(CloseableIterable.java:109)
        at 
org.apache.iceberg.io.CloseableIterable$2.iterator(CloseableIterable.java:72)
        at 
org.apache.iceberg.io.CloseableIterable$2.iterator(CloseableIterable.java:64)
        at org.apache.iceberg.util.Filter.lambda$filter$0(Filter.java:34)
        at 
org.apache.iceberg.io.CloseableIterable$2.iterator(CloseableIterable.java:72)
        at 
org.apache.iceberg.flink.source.RowDataFileScanTaskReader.open(RowDataFileScanTaskReader.java:110)
        at 
org.apache.iceberg.flink.source.DataIterator.openTaskIterator(DataIterator.java:139)
        at 
org.apache.iceberg.flink.source.DataIterator.updateCurrentIterator(DataIterator.java:129)
        at 
org.apache.iceberg.flink.source.DataIterator.hasNext(DataIterator.java:109)
        at 
org.apache.iceberg.flink.source.reader.ArrayPoolDataIteratorBatcher$ArrayPoolBatchIterator.hasNext(ArrayPoolDataIteratorBatcher.java:82)
        at 
org.apache.iceberg.flink.source.reader.IcebergSourceSplitReader.fetch(IcebergSourceSplitReader.java:82)
        at 
org.apache.flink.connector.base.source.reader.fetcher.FetchTask.run(FetchTask.java:58)
        at 
org.apache.flink.connector.base.source.reader.fetcher.SplitFetcher.runOnce(SplitFetcher.java:142)
        ... 6 more
   Caused by: java.net.SocketException: Connection reset
        at java.base/java.net.SocketInputStream.read(SocketInputStream.java:186)
        at java.base/java.net.SocketInputStream.read(SocketInputStream.java:140)
        at 
java.base/sun.security.ssl.SSLSocketInputRecord.read(SSLSocketInputRecord.java:484)
        at 
java.base/sun.security.ssl.SSLSocketInputRecord.readFully(SSLSocketInputRecord.java:467)
        at 
java.base/sun.security.ssl.SSLSocketInputRecord.decodeInputRecord(SSLSocketInputRecord.java:243)
        at 
java.base/sun.security.ssl.SSLSocketInputRecord.decode(SSLSocketInputRecord.java:181)
        at java.base/sun.security.ssl.SSLTransport.decode(SSLTransport.java:111)
        at 
java.base/sun.security.ssl.SSLSocketImpl.decode(SSLSocketImpl.java:1510)
        at 
java.base/sun.security.ssl.SSLSocketImpl.readApplicationRecord(SSLSocketImpl.java:1477)
        at 
java.base/sun.security.ssl.SSLSocketImpl$AppInputStream.read(SSLSocketImpl.java:1066)
        at 
org.apache.http.impl.io.SessionInputBufferImpl.streamRead(SessionInputBufferImpl.java:137)
        at 
org.apache.http.impl.io.SessionInputBufferImpl.fillBuffer(SessionInputBufferImpl.java:153)
        at 
org.apache.http.impl.io.SessionInputBufferImpl.read(SessionInputBufferImpl.java:205)
        at 
org.apache.http.impl.io.ContentLengthInputStream.read(ContentLengthInputStream.java:176)
        at 
org.apache.http.conn.EofSensorInputStream.read(EofSensorInputStream.java:135)
        at java.base/java.io.FilterInputStream.read(FilterInputStream.java:133)
        at 
software.amazon.awssdk.services.s3.checksums.ChecksumValidatingInputStream.read(ChecksumValidatingInputStream.java:112)
        at java.base/java.io.FilterInputStream.read(FilterInputStream.java:133)
        at 
software.amazon.awssdk.core.io.SdkFilterInputStream.read(SdkFilterInputStream.java:66)
        at org.apache.iceberg.aws.s3.S3InputStream.read(S3InputStream.java:109)
        at 
org.apache.iceberg.avro.AvroIO$AvroInputStreamAdapter.read(AvroIO.java:117)
        at 
org.apache.avro.file.DataFileReader$SeekableInputStream.read(DataFileReader.java:296)
        at 
org.apache.avro.file.DataFileReader$SeekableInputStream.read(DataFileReader.java:301)
        at 
org.apache.avro.io.BinaryDecoder$InputStreamByteSource.read(BinaryDecoder.java:912)
        at org.apache.avro.file.DataFileReader.sync(DataFileReader.java:201)
        at 
org.apache.iceberg.avro.AvroIterable$AvroRangeIterator.<init>(AvroIterable.java:115)
        ... 20 more
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to