Re: [Beowulf] Lustre Upgrades

2018-07-25 Thread Jeff Johnson
On Wed, Jul 25, 2018 at 3:16 AM, Chris Samuel wrote: > > I'm not sure, but I know people who've recently (last month or two) had a > world of pain running CephFS with multiple MDS's when it managed to get > into a > split brain situation (if my understanding of what happened is right) . > > Split

Re: [Beowulf] Lustre Upgrades

2018-07-25 Thread Joe Landman
On 07/25/2018 04:36 PM, Prentice Bisbal wrote: Paging Dr. Joe Landman, paging Dr. Landman... My response was "I'd seen/helped build/benchmarked some very nice/fast CephFS based storage systems in $dayjob-1.  While it is a neat system, if you are focused on availability, scalability, and

[Beowulf] Systems Programmer / Administrator position at Rutgers University's Office of Advanced Research Computing

2018-07-25 Thread Eric Marshall
https://jobs.rutgers.edu/postings/70184 Rutgers, The State University of New Jersey, is seeking a Systems Programmer / Administrator for the Office of Advanced Research Computing. This position is under the direction of the Director of Advanced Computing Infrastructure (ACI). The Systems Progra

Re: [Beowulf] ServerlessHPC

2018-07-25 Thread Douglas Eadline
Wow, bitcoins! Sign me up -- Doug > All credit goes to Pim Schravendijk for coining a new term on Twitter > today > https://twitter.com/rdwrt > https://twitter.com/rdwrt/status/1021761796498182144?s=03 > > We will all be doing it in six months time. > > -- > MailScanner: Clean > > ___

Re: [Beowulf] Lustre Upgrades

2018-07-25 Thread Prentice Bisbal
Paging Dr. Joe Landman, paging Dr. Landman... Prentice On 07/24/2018 10:19 PM, James Burton wrote: Does anyone have any experience with how BeeGFS compares to Lustre? We're looking at both of those for our next generation HPC storage system. Is CephFS a valid option for HPC now? Last time I

Re: [Beowulf] emergent behavior - correlation of job end times

2018-07-25 Thread Jonathan Engwall
Maybe as far as your datastore knows the job is already done. On July 24, 2018, at 12:19 PM, David Mathog wrote: Hi all, Thought some of you might find this interesting. Using the WGS (aka CA aka Celera) genome assembler there is a step which runs a large number (in this instance, 47634) of o

Re: [Beowulf] Lustre Upgrades

2018-07-25 Thread Chris Samuel
On Wednesday, 25 July 2018 12:19:43 PM AEST James Burton wrote: > Is CephFS a valid option for HPC now? Last time I played with CephFS it > wasn't ready for prime time, but that was a few years ago. I'm not sure, but I know people who've recently (last month or two) had a world of pain running C