Re: [Beowulf] Singularity 1.0 is out

2016-04-25 Thread Walid
Hi Jeff, So can one build an app in Centos 6, and run it in Cray XC40? would it be an alternative to shifter, and docker? I do not have access to Cray system yet, however this would solve one issue for our app developers. i will go read more about it On 17 April 2016 at 16:53, Jeffrey Layton wro

[Beowulf] ELK query "Elasticsearch, logstash and Kibana"

2014-08-07 Thread Walid
side of systems, however i see huge potential for people who can not afford to have splunk, I am thinking scheduler, interconnect, GPFS, provisioning, system logs, job tractability, metrics from accounting file, profiling, better failure management and visualisation. kind regards Walid

Re: [Beowulf] job scheduler and health monitoring system

2014-01-11 Thread Walid
UGE is used in over thousands of nodes, health checks are done via load sensors, a SGE/UGE feature. however i am not aware of any public repo for shared health checks. as for overheat, in one cluster it was done at the bios/firmware level by asking the vendor for certain thresholds to shut the mach

[Beowulf] Configuration management tools/strategy

2013-01-06 Thread Walid
, Saltstack, ansible, and blueprint. there might other products that we need to evluate in partnership such as Foreman, spacewalk, ..etc. I would like to hear from you if you did evaluate such tools, or using one, or have a different strategy in keeping and maintaining configurations. Thank you, Walid

Re: [Beowulf] Any beowulfers attending SC12?

2012-11-10 Thread Walid
Hi, I'll be at SC12 as well, and hopefully at Beobash. it is my first time to SC, and hopefully my first to Beobash as well if i can attend. regards Walid On 8 November 2012 06:59, Lawrence Stewart wrote: > I'll be there, with either my Serissa or Quanta Research Cambridge hats o

Re: [Beowulf] Open Grid Scheduler (SGE)

2011-01-24 Thread Walid
Doug, That would be great, as far as i know Oracle have not commented on this turn of events clearly yet. kind regards Walid On 24 January 2011 16:23, Douglas Eadline wrote: > As far as I understand, there will be one Oracle Grid Engine (commercial) > There are two open versions a

Re: [Beowulf] Open Grid Scheduler (SGE)

2011-01-23 Thread Walid
Doug, I am not sure any more which is which? we will have two commercial Grid Engines, and two Open source grid engines. will they all be in sync, code versioning, patching, features, or they will be complete forks each one will have its own roadmap? kind regards Walid On 20 January 2011 19:19

Re: [Beowulf] Kernel action relevant to us

2010-08-13 Thread Walid
Greg, do we know if that have made it to any Linux Kernel? kind regards Walid On 17 December 2009 05:05, Greg Lindahl wrote: > The following patch, not yet accepted into the kernel, should allow > local TCP connections to start up faster, while remote ones keep the > same behavio

[Beowulf] HPC/mpi courses

2010-01-16 Thread Walid
Dear All, do you know of any official courses run in Europe, or Asia covering HPC system, or development. mpi or new distributed memory paradigms are welcome. kind regards Walid ___ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin

[Beowulf] high %system utiliziation on infiniband nodes

2008-07-26 Thread Walid
0.00 0.00 0.00 now i get the same behaviour on RHEL5.0/5.1 and RHEL4.6, using Infiniband or ethernet, so is this normal, to me it does not, or at least i have never seen such behaviour before? the node is a DELL PE1950 regards Walid ___ Beowulf

Re: [Beowulf] RHEL5 network throughput/scalability

2008-06-14 Thread Walid
2008/6/14 Perry E. Metzger <[EMAIL PROTECTED]>: > > A number of these seem rather odd, or unrelated to performance. > > Walid <[EMAIL PROTECTED]> writes: > > It is lame, however i managed to get the following kernel paramter to > scale > > well in

Re: [Beowulf] RHEL5 network throughput/scalability

2008-06-13 Thread Walid
.icmp_echo_ignore_broadcasts = 0 net.ipv4.tcp_max_orphans = 262144 net.core.netdev_max_backlog = 2000 regards Walid 2008/6/13 Walid <[EMAIL PROTECTED]>: > 2008/6/13 Jason Clinton <[EMAIL PROTECTED]>: > >> >> We've seen fairly erratic behavior induced by newer drivers for NVidia &

Re: [Beowulf] RHEL5 network throughput/scalability

2008-06-13 Thread Walid
down where it did perform well, I have saved the sysctl and will check what parameters have made the difference. regards Walid ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

[Beowulf] RHEL5 network throughput/scalability

2008-06-12 Thread Walid
from around 500+MBps to around 300, and again from RHEL4 the behaviour is different. any pointers? TIA Walid ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman

Re: [Beowulf] size of swap partition

2008-06-10 Thread Walid
Hi, For an 8GB dual socket quad core node, choosing in the kick start file --recommended instead of specifying size RHEL5 allocates 1GB of memory. our developers say that they should not swap as this will cause an overhead, and they try to avoid it as much as possible regards Walid On 10/06

Re: [Beowulf] Configuration change monitoring

2007-08-30 Thread Walid
d on other ways of distributing configurations files safely, and in a timely manner. regards Walid ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

Fwd: [Beowulf] Configuration change monitoring

2007-08-30 Thread Walid
-- Forwarded message -- From: Walid <[EMAIL PROTECTED]> Date: Aug 30, 2007 9:22 PM Subject: Re: [Beowulf] Configuration change monitoring To: "Robert G. Brown" <[EMAIL PROTECTED]> Hi Robert On 8/30/07, Robert G. Brown <[EMAIL PROTECTED]> wrote: >

Re: [Beowulf] Configuration change monitoring

2007-08-30 Thread Walid
ers), as a result most of the team that manages the system are junior admins, and the problem with cfengine it does require a steep learning curve, may be i need to revisit it or one of its alternatives. regards Walid ___ Beowulf mailing list, Beowulf@beow

Re: [Beowulf] Configuration change monitoring

2007-08-30 Thread Walid
d log monitoring probably using SEC, I could advise a solution, and the alerting probably could be done using SEC/GroundWorks/Zenoss regards Walid ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) vi

[Beowulf] Configuration change monitoring

2007-08-29 Thread Walid
Walid ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] IB switches: managed or not?

2007-03-07 Thread Walid
e also some scripts that helps in configuration of the fabric, and cluster regards Walid ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] network filesystem

2007-03-05 Thread Walid
v4 brings to the table standard client implementation. unfortunately Red Hat recommends RHEL5 which should be out soon now for NFSv4 Walid. ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://w

Re: [Beowulf] failure rates

2007-02-05 Thread Walid
thesis? will the question of how the reduce or manage the failures be part of it? regards Walid. ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] non-proprietary IPMI card?

2006-12-02 Thread Walid
s Full, you can not manage it any more, and apperantly it does get filled up during crashes. regards Walid. ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] non-proprietary IPMI card?

2006-11-28 Thread Walid
to manage nodes remotley when the node has a kernel panic for example. and we had other intermitent problems that we do not yet know the cause exatly.i am really interested to know what other implications that bridging/sharing will imply regards Walid.

[Beowulf] Myrinet metrics monitoring via ganglia

2006-07-12 Thread Walid
netrcv_cnt for this? any one using snmp on myrinte switchs would like to share their experince TIA regards Walid ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman

[Beowulf] 512 nodes Myrinet cluster Challanges

2006-04-27 Thread Walid
intensve, and requires large memory models. any hints, pointers will be apperciated TIA Walid. ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

[Beowulf] Fwd: NIS limitations question

2006-02-06 Thread Walid
Dear All,I belive i have seen on this maling list*, and other internet fourms** some limitation of NIS, but i have failed to find a documented limiation from SUN, or from the various linux distrubutions, did any one try to research the scalability of NIS servers?  The reason i am asking on a 256 n