Neuw84 commented on code in PR #11383:
URL: https://github.com/apache/iceberg/pull/11383#discussion_r1824255200


##########
docs/docs/aws.md:
##########
@@ -669,6 +669,13 @@ Users can use catalog properties to override the defaults. 
For example, to confi
 --conf spark.sql.catalog.my_catalog.http-client.apache.max-connections=5
 ```
 
+**Note that for workloads with exceptionally high throughput against tables 
that S3 where you will likely to increase Retries, you will also like to 
increase the number of connections for the HTTP client**
+
+```shell
+spark.sql.catalog.my_catalog.http-client.apache.max-connections=200

Review Comment:
   It is a thing of AWS SDK and Spark ( not specifically to EMR). If you use 
Spark on your laptop writing to S3 and you are on this high throughput write 
scenario you will likely like to tune the parameter. 
   
   I mean, we are speaking about a AWS docs in this page ( the parameter is 
quite specific to the S3 client of the AWS SDK). 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to