Hello folks, I just wanted to share a PoC of running HDFS on Kubernetes that I hacked on over the summer.
https://github.com/aespinosa/cloud-hadoop The above repo introduces a "hdfs kube_namenode" command where the namenode formats itself or runs "-bootstrapStandby" depending on the state of provisioning an HDFS cluster. The state information is stored inside Kubernetes as metadata annotations to the manifest .yml of running HDFS. With this, it is simpler to bring up HA sandbox HDFS clusters without the need for externally orchestrating namenode and zkfc formatting. Hope others find this useful. Regards, Allan --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
