Hi,

For this amount of nodes, I'd go with automation tools like
Ansible[1]/Puppet[2]/Rex[3]. They can install necessary packages, setup
/etc/hosts and make per-node settings.

Ansibles has a nice playbook
(https://github.com/analytically/hadoop-ansible) you can start with and
Puppet isn't short either (https://forge.puppet.com/tags/hadoop).

Best,
Sanel

[1] https://ansible.com
[2] https://puppet.com
[3] https://rexify.org

"Zaki SEc." <[email protected]> writes:
> [I am sorry in case this mail is sent twice, it was not intentional]
>
> Hi!
>
> I'm fairly new to Hadoop, but I've been browsing the documentation and
> 'how-to'-s for some time now.
>
> My question would be as follows; How can one setup a cluster, where the
> nodes aren't static?
> What I mean is, I want to be able to run a cluster, say, 20 machines, where
> each of the nodes have Hadoop installed, and they 'recognize' each other -
> saving me from having to manually set their hostnames and configure their
> '/etc/hosts' file.
>
> I did look into Apache Ambari, hoping that it would give me an easy
> solution to the above problem, but it does not support Ubuntu 16.04 which I
> have to work with, and it failed to build for various reasons.
> I have also looked into Cloudera's CDH distribution, (the manual
> installation) but that has the same problem - it asks me to manually
> configure these settings for each node.
>
> It seemed to me, that "Rack Awareness" could potentially solve my problem,
> but after some reading, I had to realize that it's for a different thing
> entirely.
> So now it looks like I'm out of options.
>
> Lately, I was wondering about writing an external script, that would update
> the settings for each of the nodes automatically, based on one central
> 'list', hosted on, for ex. the NameNode. While this isn't nearly on the
> level of a real dynamic setup, it would make my job significantly easier.
>
> Thanks in advance,
> Zaki

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to