Hi,
I am creating a multi-node Hadoop cluster for a personal project, and I would
like to use the official docker image
(apache/hadoop<https://hub.docker.com/r/apache/hadoop>).
However, looking at the official docker image documentation and the
docker-compose file I have seen the following environment variable:
environment:
ENSURE_NAMENODE_DIR: "/tmp/hadoop-root/dfs/name"
I would like to know if it is safe to create the namenode directory in the /tmp
folder since this kind of folder is neither secure nor data persistent. Thus, I
would like to understand which path is the best practice for this. Moreover,
which are other environment variables I could use of.
Thanks a lot, in advance.
Kind regards,
Nikos Spanos
M.Sc Business Analytics & Big Data| Athens University of Economics & Business
Phone Number: +306982310494
Linkedin profile <http://linkedin.com/in/spanosnikos>