JunchengMa commented on issue #6679:
URL: https://github.com/apache/iceberg/issues/6679#issuecomment-1416599385

   > 
   
   +1 on @dramaticlly 's comment, changing the write distribution mode affects 
Spark job performance (causes heavy shuffle) when using Spark SQL like
   ```
   DELETE FROM db_name.tbl_name WHERE date < '20220801'
   ```
   or
   ```
   UPDATE db_name.tlb_name SET col_a = NULL WHERE date <= '20220801'
   ```
   setting `write.delete.distribution-mode`=`none` and 
`write.update.distribution-mode`='none' at table properties would reduce 
shuffle, but could affect other normal jobs writing to the same table.
   So having an option for specifying the write distribution mode would be 
ideal.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to