Re: [I] Spark rewrite Files Action OOM [iceberg]

via GitHub Sun, 07 Apr 2024 10:29:41 -0700


nk1506 commented on issue #10054:
URL: https://github.com/apache/iceberg/issues/10054#issuecomment-2041538414


   > @nk1506 Echoing Russell's comments, how many small files are there in your 
OOM case? How much memory do you set up?
   
   I didn't use spark-engine for compaction. I was using Java Client API. My 
queries might distract from the original problem. Although my requirement is to 
compact very large datasets(say 10K datafiles) with single commit. Using 
[RewriteFiles](https://github.com/apache/iceberg/blob/main/api/src/main/java/org/apache/iceberg/RewriteFiles.java#L171)
 always might cause OOM. So I am looking something which can help to manage 
manifestFiles more intelligently. I think I will start different thread to 
discuss the other problem. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [I] Spark rewrite Files Action OOM [iceberg]

Reply via email to