Fokko commented on code in PR #922:
URL: https://github.com/apache/iceberg-python/pull/922#discussion_r1677113136


##########
pyiceberg/catalog/glue.py:
##########
@@ -117,6 +124,12 @@
 ICEBERG_FIELD_OPTIONAL = "iceberg.field.optional"
 ICEBERG_FIELD_CURRENT = "iceberg.field.current"
 
+GLUE_PROFILE_NAME = "glue.profile-name"
+GLUE_REGION = "glue.region"
+GLUE_ACCESS_KEY_ID = "glue.access-key-id"

Review Comment:
   How common is it to have a separate `access-key-id` for glue and the table 
itself? The S3FileIO would not pick up the `glue.access-key-id`:
   
   
https://github.com/apache/iceberg-python/blob/3f44dfe711e96beda6aa8622cf5b0baffa6eb0f2/pyiceberg/io/pyarrow.py#L349
   
   This way you would need to set both `glue.access-key-id` (which is used for 
authenticating against Glue), and then you need to set `s3.access-key-id` to 
ensure that it can access the metadata on the bucket. Maybe good to highlight 
this on the docs, WDYT?
   
   I'm not an AWS expert, but my gut feeling is that normally people rely on 
`AWS_ACCESS_KEY_ID` to be picked up for both Glue and S3, but I prefer to be 
able to put this in the `~/.pyiceberg.yaml` as well.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to