Re: [Beowulf] [External] Spark, Julia, OpenMPI etc. - all in one place

Benson Muite Tue, 13 Oct 2020 05:26:21 -0700


On 10/13/20 3:12 PM, Oddo Da wrote:

Jim, Peter: by things have not changed in the tooling I meant that it isthe same approach/paradigm as it was when I was in HPC back in the late1990s/early 2000s. Even if you look at books about OpenMPI, you can goon their mailing list and ask what books to read and you will be pointedto the same stuff published 20+ years ago and maybe there are one or twobooks that are "fresher" than that (I did that a few months ago, naivelythinking that things have changed ;-) ).
The approach is still the same - you have to write the code at the lowlevel and worry about everything. It would be nice if this was improvedand things were abstracted up and away a bit. The appearance of Spark,for example, did exactly that for data science/machine learning/"bigdata" - esp. when you write it in Scala (functional programming) - itjust makes for all sorts of cleaner, abstracted, more correct code wherethe framework worries about the underlying data/computation locality,the communication between all the machinery etc. etc. and you are leftto worry about the problem you are solving. I just feel that in the HPCworld we have not moved to this point yet and am trying to understand why.
I mean, let's say I was a data science researcher at a university andall that was on offer was the traditional HPC cluster - what toolingwould I use to do my research? The whole world is doing something elsebut I am stuck worrying about the low level details.... or I need to askfor a separate HDFS/Spark cluster? What if I want to stream data fromsomewhere like it is done commonly in the industry (solutions like Kafkaetc.) - my only option is to stand up a local cluster (costs time,money, ongoing admin/maintenance) or to go to AWS or Azure and spend taxpayer money to fill corporate coffers for what should a;ready be asolved problem with the money that was spent for all the hardware at theUniversity already?
BTW, Spark is just an example of how tooling/methodologies have improvedin the industry in the domain of distributed computation. This is why Ithought that Julia may be one of those things that provides a different(improved?) way of doing things where both the climate modeling guys andthe data science guys can utilize the same HPC hardware....

A number of countries have national infrastructures. Small and moderateallocations on XSEDE or similar allow people some experience with HPCwithout their institution investing in significant computationalresources. The problem is usually, that knowledge on using theseresources may then be scarce at an institution without any HPC resources.

A typical university cluster can run a data science workload with Spark,Hadoop etc., just requires Admins to make this possible. Systems likeComet are made for this kind of work:

https://portal.xsede.org/sdsc-comet

Once jobs start using tens to hundreds of thousands of cores hours,taxpayer money (probably also the environment) is saved by writing in alow level language.

A small number of countries design entirely new systems and train theirstudents write/port software for them - much as happened with bleedingedge systems 20 years ago:)

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
https://beowulf.org/cgi-bin/mailman/listinfo/beowulf

Re: [Beowulf] [External] Spark, Julia, OpenMPI etc. - all in one place

Reply via email to