: <fieldtype name="text_lc" class="solr.TextField" tokenized="false">
: <analyzer type="index">
: <tokenizer class="my.custom.TokenizerFactory"/>
: <filter class="my.custom.FilterFactory" words="stopwords.txt"/>
: <filter class="solr.LowerCaseFilterFactory"/>
: <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
: </analyzer>
: </fieldtype>
...
: only do indexing on the master server. However, with this schema in place
: on the slaves, as well as our custom.jar in the solrHome/lib directory, we
: run into these issues where the memory usage grows and grows without
: explanation.
...even if you only o indexing on the master, having a single analyzer
defined for a field means it's used at both index and query time (even
though you say 'type="index"') so a memory leak in either of your custom
factories could cause a problem on a query box.
This however concerns me...
: fact, in a previous try, we had simply dropped one of our custom plugin jars
: into the lib directory but forgot to deploy the new solrconfig or schema
: files that referenced the classes in there, and the issue still occurred.
...this i can't think of a rational explanation for. Can you elaborate on
what you can do to create this problem .. ie: does the memory usage grow
even when solr doesn't get any requests? or do it happen when searches are
executed? or when commits happen? etc...
If the problem is as easy to reproduce as you describe, can you please
generate some heap dumps against a server that isn't processing any
queries -- one from when hte server first starts up, and one from when hte
server crashes from an OOM (there's a JVM option for generating heap dumps
on OOM that i can't think of off hte top of my head)
-Hoss