RussellSpitzer opened a new issue, #9458:
URL: https://github.com/apache/iceberg/issues/9458

   ### Feature Request / Improvement
   
   There are several places in our code currently where a failure while reading 
a file will throw an exception but the exception will not contain any 
information related to which file was being read during the failure. The Avro 
reader is an example of this.
   
   When planning a table scan with a corrupted manifest the user will end up 
with an exception like
   ```java
   org.apache.iceberg.exceptions.RuntimeIOException: Failed to read next record 
at 
   
org.apache.iceberg.avro.AvroIterable$AvroReuseIterator.next(AvroIterable.java:204)
 at
   org.apache.iceberg.io.CloseableIterable$7$1.next(CloseableIterable.java:202) 
at
   org.apache.iceberg.io.FilterIterator.advance(FilterIterator.java:65) at
   org.apache.iceberg.io.FilterIterator.hasNext(FilterIterator.java:49) at
   
org.apache.iceberg.io.CloseableIterable$7$1.hasNext(CloseableIterable.java:197) 
at
   org.apache.iceberg.io.CloseableIterator$2.hasNext(CloseableIterator.java:72) 
at
   org.apache.iceberg.io.ClosingIterator.hasNext(ClosingIterator.java:39) at
   
scala.collection.convert.JavaCollectionWrappers$JIteratorWrapper.hasNext(JavaCollectionWrappers.scala:37)
 at
   scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:593) at
   scala.collection.Iterator$$anon$9.hasNext(Iterator.scala:576) at
   
org.apache.spark.sql.catalyst.expressions.GeneratedClass$GeneratedIteratorForCodegenStage6.processNext(Unknown
 Source) at 
   ``` 
   
   
   This is obviously not very useful since we do not know which one it is. I 
think we should go into our AvroIterable (and other file format readers) and 
make sure that when they include the file path in the error.
   
   ### Query engine
   
   None


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to