atifiu commented on issue #9330:
URL: https://github.com/apache/iceberg/issues/9330#issuecomment-1869464036
   @xuchang-66 @TechTinkerer42 I have tried to test this with 
`write.distribution-mode = none` and it can indeed help to improve the 
performance but can introduce the problem of small files. But we can control 
number of file created by repartitioning the input data on some column which 
can also distribute the data in each of the spark partition evenly and number 
of repartition should be a factor of number of cores. Each spark partition will 
create one file if we are writing to a single partition.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to