Karen,
Thanks for the clarifications.
I'm concerned about the software, but it looks like we'll install Hadoop
On Demand, as someone has already promised a user we'd do it... If
there were serious pitfalls, I might be able to slow it down some, but
simply inefficient isn't sufficient... we have users writing MPI code
who daily redefine "inefficient"!
Again, thanks!
gerry
Karen Shaeffer wrote:
On Fri, Dec 26, 2008 at 05:16:04PM -0600, Gerry Creager wrote:
The subject line says it all: Hadoop: Anyone got any experience with it
on clusters (OK, so Google does, but that really wasn't the question,
was it?).
Hi,
Google doesn't use Hadoop. Google published some papers on their
distributed computing environment that they invented. Then some
Java programmers implemented Hadoop after reading the papers
published by Google. Hadoop is grossly inefficient, as it is written
in Java. But it does work. Folks who use Hadoop include Yahoo. I
believe Amazon uses it as well. If you care about CPU cycles, then
you really don't want to get involved with Hadoop.
Karen
--
Gerry Creager -- gerry.crea...@tamu.edu
Texas Mesonet -- AATLT, Texas A&M University
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf