javrasya commented on issue #9410:
URL: https://github.com/apache/iceberg/issues/9410#issuecomment-1879135064

   @pvary I wasn't aware of `splitOpenFileCost`, thank you for sharing that. 
   
   The way how I did it is that I introduced my own SplitAssignerFactory and 
SplitAssigner and pass that down to the source. Here is the code to that custom 
Splitassigner;
   
   https://gist.github.com/javrasya/98cfe90bd1a2585c56c4c3346a518477
   
   But the thing is that even though I manage to reduce the number of task per 
Split, it was still big enough to raise the same error. So it did not solve my 
problem.
   
   How do you limit the statistics, I didn't do anything apart from creating my 
table and ingesting the data into it. Also do you think a table with 14 columns 
is too wide? 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to