solr resource usage patterns

2018-04-28 Thread Nicola Gordon
Hello, Hoping someone has some insight on this. I need to understand resource usage patterns seen at solr cluster. Any insight/any info on what solr is doing would be much appreciated! Here's what I see: This pattern of CPU usage is seen throughout indexing - each of the lines is one of the

Re: multilingual list of stopwords

2007-10-18 Thread Gordon
Maria, It's perfectly reasonable to build a single list, sort it, and scan it for especially bad cases. See for example, http://members.unine.ch/jacques.savoy/clef/index.html for stopwords for several languages or check in some standard programming modules like: http://search.cpan.org/~fabpot/Ling