Re: [PR] Flink: Custom partitioner for bucket partitions [iceberg]

via GitHub Sun, 15 Oct 2023 18:49:42 -0700


chenwyi2 commented on PR #7161:
URL: https://github.com/apache/iceberg/pull/7161#issuecomment-1763604926


   yes, I am creating very fine grained partitions, because i want to query and 
comput some business metrics between minutes ss fast as possible. As for bucket 
number, i use a fomula QPS * 500B/条 / bucket number / 1024 /1024  = 10M/秒(The 
write-in traffic of one bucket), because i found that the write-in traffic of 
one bucket is large, the writer can  be OOM or backpressure, i set 10M/秒 in 
each bucket.
   In this pr hdfs can be influenced by many files, writting to files can be 
slow then data latency will be relatively large.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [PR] Flink: Custom partitioner for bucket partitions [iceberg]

Reply via email to