andygrove opened a new pull request, #1579: URL: https://github.com/apache/datafusion-ballista/pull/1579
## Summary - Add `cloud-storage.md` with a Python S3 quickstart guide covering object store registration, external table creation, Hive partition pitfalls, Kubernetes deployment, and common troubleshooting errors - Restructure Python docs into a `user-guide/python/` subdirectory (mirrors the existing `deployment/` pattern), with separate pages for quickstart, S3/cloud storage, and Jupyter notebooks - Narrow the cloud storage page title to "Querying S3 Data" (previously claimed S3, GCS, and Azure but only S3 was documented) - Clarify the two Python connection APIs (`BallistaSessionContext` vs `BallistaBuilder`) with a short explanation of when to use each - Fix code fences in REPL examples (`text` → `python`) ## Test plan - [ ] Verify docs build without Sphinx warnings (`cd docs && make html`) - [ ] Confirm all nav links resolve correctly in the rendered output - [ ] Verify `setup_test_cluster` is a real exported symbol in the Python bindings (used in `jupyter.md`) 🤖 Generated with [Claude Code](https://claude.com/claude-code) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
