I am often asked to take a look at one to many Solr log files that are
hundreds of megabytes to gigabytes in size. "Peaking" at this amount of
logs is a bit time consuming. Anybody that does this often enough has to
build a log parsing tool eventually. One off greps can only get you so far.
The last log reader I started putting together was after a hurricane a year
or two back while I was without power for a week. I have a 6 core machine
and a lot of RAM and I wanted to be able to blow through hundreds of
megabytes of log files in a few seconds (why not). So I started this multi
threaded single file Solr log reader that uses MappedByteBuffers. As I have
had to do a little debugging here or there over the months I have added a
bit to the program. It's my current go to tool when faced with a dozen
gigabyte log files. And it's pretty darn fast if you have some cores to
throw at it.

At this point, to avoid losing the program and someday starting yet again,
I've shared it on GitHub and invite anyone else looking to have a really
good standard log analyzer available to help complete it or offer logs for
testing. It's still early. It's still basically in personal, hack project
phase. But as I have further needs I will continue to add to it and improve
it. It already does a bit more than the current sample output in the README.

My goal is to make it generic enough to read a variety of formats – either
by being flexible internally or through user configuration. With a little
effort, there is a lot of great information and summarization that can be
pulled out of Solr logs.

https://github.com/markrmiller/SolrLogReader


-- 
- Mark

http://about.me/markrmiller

Reply via email to