Re: [Beowulf] Large amounts of data to store and process

2019-03-03 Thread Jonathan Aquilina
These would be numerical data such as integers or floating point numbers. -Original Message- From: Tony Brian Albers Sent: 04 March 2019 08:04 To: beowulf@beowulf.org; Jonathan Aquilina Subject: Re: [Beowulf] Large amounts of data to store and process Hi Jonathan, From my limited know

Re: [Beowulf] Large amounts of data to store and process

2019-03-03 Thread Tony Brian Albers
Hi Jonathan, From my limited knowledge of the technologies, I would say that HBase with file pointers to the files placed on HDFS would suit you well. But if the files are log files, consider some tools that are suited for analyzing those like Kibana. /tony On Mon, 2019-03-04 at 06:55 +, J

Re: [Beowulf] Large amounts of data to store and process

2019-03-03 Thread Jonathan Aquilina
Hi Tony, Sadly I cant go into much detail due to me being under an NDA. At this point with the prototype we have around 250gb of sample data but again this data is dependent on the type of air craft. Larger aircraft and longer flights will generate a lot more data as they have more sensors and

Re: [Beowulf] Large amounts of data to store and process

2019-03-03 Thread Tony Brian Albers
On Mon, 2019-03-04 at 06:38 +, Jonathan Aquilina wrote: > Good Morning all, > > I am working on a project that I sadly cant go into much detail but > there will be quite large amounts of data that will be ingested by > this system and would need to be efficiently returned as output to > the en

[Beowulf] Large amounts of data to store and process

2019-03-03 Thread Jonathan Aquilina
Good Morning all, I am working on a project that I sadly cant go into much detail but there will be quite large amounts of data that will be ingested by this system and would need to be efficiently returned as output to the end user in around 10 min or so. I am in discussions with another partn

Re: [Beowulf] 2 starting questions on how I should proceed for a correct first micro-cluster (2-nodes) building

2019-03-03 Thread Marco Ippolito
Thank you very much Greg,Douglas, John and Michael. You very kindly "overwhelmed" me, and I thank you for that, with hints of things I didn't know about. So my very next step will be to understand each of your hints, and I guess I will come back with some more practical questions about them. In the

Re: [Beowulf] 2 starting questions on how I should proceed for a correct first micro-cluster (2-nodes) building

2019-03-03 Thread Greg Keller
I Third OpenHPC, or at least the Warewulf underpinnings in it. http://warewulf.lbl.gov/ For "learning" the software stack you may consider beefing up your current node and running virtualized environment inside it? I use the community version of Proxmox (https://www.proxmox.com/en/downloads). On

Re: [Beowulf] 2 starting questions on how I should proceed for a correct first micro-cluster (2-nodes) building

2019-03-03 Thread Douglas Eadline
One thing to remember, a cluster is often defined by its environment and goals. 1. HPC type clusters, as discussed on this list, operate in a particular way (cluster nodes are provisioned in a reproducible way, job scheduler that provides dynamic resources to users, clusters are optimized for pro

Re: [Beowulf] 2 starting questions on how I should proceed for a correct first micro-cluster (2-nodes) building

2019-03-03 Thread Gerald Henriksen
On Sun, 3 Mar 2019 09:28:03 +, you wrote: >ps. If you are interested in parallelism... there is Julia. >https://docs.julialang.org/en/v1/manual/parallel-computing/index.html Also Chapel https://chapel-lang.org/ ___ Beowulf mailing list, Beowulf@beo

Re: [Beowulf] 2 starting questions on how I should proceed for a correct first micro-cluster (2-nodes) building

2019-03-03 Thread John Hearns via Beowulf
ps. If you are interested in parallelism... there is Julia. https://docs.julialang.org/en/v1/manual/parallel-computing/index.html I would also advise setting up just one server and install the latest CentOS. You can start with some tutorials on MPI - which is the current standard for parallelism.

Re: [Beowulf] 2 starting questions on how I should proceed for a correct first micro-cluster (2-nodes) building

2019-03-03 Thread John Hearns via Beowulf
I second OpenHPC. It is actively maintained and easy to set up. Regarding the hardware, have a look at Doug Eadlines Limulus clusters. I think they would be a good fit. Dougs site is excellent in general https://www.clustermonkey.net/ Also some people build Raspberry Pi clusters for learning. O