paulpaul1076 commented on issue #9679:
URL: https://github.com/apache/iceberg/issues/9679#issuecomment-1943746314

   @nastra these are the logs from the driver that does compaction and fails 
with this content length exception, and from one of the executors:
   
   [logs.zip](https://github.com/apache/iceberg/files/14279784/logs.zip)
   
   Btw, this exception also started popping up when querying with Trino:
   
   ```
   Caused by: io.trino.spi.TrinoException: Error opening Iceberg split 
s3a://obs-zdp-warehouse-stage-mz/stage.db/test_simply_partitioned3/data/date=2024-02-14/00177-46924-33e1de6e-67aa-445d-89b2-a3baaa190813-00001.parquet
 (offset=4, length=6386727): Read 49152 tail bytes of file 
s3a://obs-zdp-warehouse-stage-mz/stage.db/test_simply_partitioned3/data/date=2024-02-14/00177-46924-33e1de6e-67aa-445d-89b2-a3baaa190813-00001.parquet
 failed: Incorrect file size (6386731) for file (end of stream not reached): 
s3a://obs-zdp-warehouse-stage-mz/stage.db/test_simply_partitioned3/data/date=2024-02-14/00177-46924-33e1de6e-67aa-445d-89b2-a3baaa190813-00001.parquet
       at 
io.trino.plugin.iceberg.IcebergPageSourceProvider.createParquetPageSource(IcebergPageSourceProvider.java:1073)
       at 
io.trino.plugin.iceberg.IcebergPageSourceProvider.createDataPageSource(IcebergPageSourceProvider.java:546)
       at 
io.trino.plugin.iceberg.IcebergPageSourceProvider.createPageSource(IcebergPageSourceProvider.java:333)
       at 
io.trino.plugin.iceberg.IcebergPageSourceProvider.createPageSource(IcebergPageSourceProvider.java:249)
       at 
io.trino.plugin.base.classloader.ClassLoaderSafeConnectorPageSourceProvider.createPageSource(ClassLoaderSafeConnectorPageSourceProvider.java:48)
       at 
io.trino.split.PageSourceManager.createPageSource(PageSourceManager.java:61)
       at 
io.trino.operator.TableScanOperator.getOutput(TableScanOperator.java:296)
       at io.trino.operator.Driver.processInternal(Driver.java:395)
       at io.trino.operator.Driver.lambda$process$8(Driver.java:298)
       at io.trino.operator.Driver.tryWithLock(Driver.java:694)
       at io.trino.operator.Driver.process(Driver.java:290)
       at io.trino.operator.Driver.processForDuration(Driver.java:261)
       at 
io.trino.execution.SqlTaskExecution$DriverSplitRunner.processFor(SqlTaskExecution.java:887)
       at 
io.trino.execution.executor.timesharing.PrioritizedSplitRunner.process(PrioritizedSplitRunner.java:187)
       at 
io.trino.execution.executor.timesharing.TimeSharingTaskExecutor$TaskRunner.run(TimeSharingTaskExecutor.java:565)
       at io.trino.$gen.Trino_428____20240214_031724_2.run(Unknown Source)
       at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
       at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
       at java.base/java.lang.Thread.run(Thread.java:840)
   ```
   
   It started happening after I regularly started running compaction. Also, 
been running compaction in Scala DSL (not Spark SQL)for a few days and it 
didn't fail a single time... So weird that Spark SQL and Trino fail.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to