kevinjqliu commented on issue #1019:
URL: 
https://github.com/apache/iceberg-python/issues/1019#issuecomment-2274369979

   >Ah I see, thanks! Using the standlone-metastore container did indeed work 
with pyiceberg.
   
   Awesome!
   
   > When I start the standalone metastore, I'm able to connect via pyiceberg. 
But I'm unable to docker exec -it metastore-standalone beeline -u 
'jdbc:hive2://localhost:9083' - is that expected behavior?
   
   Yes, this is expected. Because beeline is used to connect to HiveServer2
   
   > Is that a documentation error, or perhaps I'm misunderstanding what 
embedded Metastore means in this context?
   
   There's a couple of different ways to launch a "Hive setup" 
   1. "Launch the HiveServer2 with an embedded Metastore." This launches 
HiveServer2 and exposes its port. It also launches "an embedded Metastore" 
which as the name suggests, is not exposed. The "embedded metastore" is an 
instance of Hive Metastore.
   2. "Launch Standalone Metastore". This launch Hive Metastore as a standalone 
service and exposes its port. But no the query engine part of Hive 
(HiveServer2) is not started. 
   3. "Using HS2 with Standalone/Remote Metastore". This is a hybrid. It 
launches both HiveServer2 and Hive Metastore and exposes both of its ports. 
   
   
   Depending on what you're doing, option 3 should give you both a HiveServer2 
which you can connect to using beeline, and a Hive Metastore which you can 
connect to with PyIceberg.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to