: <fieldtype name="text_lc" class="solr.TextField" tokenized="false">
:   <analyzer type="index">
:     <tokenizer class="my.custom.TokenizerFactory"/>
:     <filter class="my.custom.FilterFactory" words="stopwords.txt"/>
:     <filter class="solr.LowerCaseFilterFactory"/>
:     <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
:   </analyzer>
: </fieldtype>
        ...
: only do indexing on the master server.  However, with this schema in place
: on the slaves, as well as our custom.jar in the solrHome/lib directory, we
: run into these issues where the memory usage grows and grows without
: explanation.

...even if you only o indexing on the master, having a single analyzer 
defined for a field means it's used at both index and query time (even 
though you say 'type="index"') so a memory leak in either of your custom 
factories could cause a problem on a query box.

This however concerns me...

: fact, in a previous try, we had simply dropped one of our custom plugin jars
: into the lib directory but forgot to deploy the new solrconfig or schema
: files that referenced the classes in there, and the issue still occurred.

...this i can't think of a rational explanation for.  Can you elaborate on 
what you can do to create this problem .. ie: does the memory usage grow 
even when solr doesn't get any requests? or do it happen when searches are 
executed? or when commits happen? etc...

If the problem is as easy to reproduce as you describe, can you please 
generate some heap dumps against a server that isn't processing any 
queries -- one from when hte server first starts up, and one from when hte 
server crashes from an OOM (there's a JVM option for generating heap dumps 
on OOM that i can't think of off hte top of my head)



-Hoss

Reply via email to