OOM errors do become annoying, so implemented https://www.tecmint.com/clear-ram-memory-cache-buffer-and-swap-space-on-linux/
to the restart sh as well, but as Shawn has said, the ones deployed have never gone down, granted each are given 31gb of heap space on a 200gb+ Ram server. sometimes its just better to throw money/hardware at it On Tue, Jun 21, 2022 at 9:34 AM Markus Jelsma <[email protected]> wrote: > We have never seen one of our Solr instances go down due to running out of > disk space. Instead, ours sometimes go down due to OutOfMemory when there > is a sudden extreme spike in activity. If that happens, Solr gets stopped. > > Our daemons are all controlled by systemd, they are then restarted > automatically and usually join the cluster again without issues. For us, we > have less work because of the automatic restarts, but only in case of these > OutOfMemory errors. As said, this does not work for disk space issues. > > Op di 21 jun. 2022 om 14:40 schreef Dave <[email protected]>: > > > In my experience if solr goes down it’s because it ran out of disk space, > > so if you automatically just bring it back up again it will just go down > > again. There are simple bash scripts you can make to run for standalone > > solr that will do what you want, you just need to be sure they destroy > and > > child solr processes that may still be lingering around > > > > > On Jun 21, 2022, at 8:16 AM, Poorna Murali <[email protected]> > > wrote: > > > > > > Hi All, > > > > > > If we have an alerting mechanism which says if a solr node is down, is > > it > > > possible to configure the alert with some action which will restart > that > > > solr node and recover automatically? > > > Please advise. > > > > > > Thanks, > > > Poorna > > >
