On Fri, Sep 7, 2012 at 2:19 PM, Tom Burton-West <tburt...@umich.edu> wrote: > Thanks Robert, > > I'll have to spend some time understanding the default codec for Solr 4.0. > Did I miss something in the changes file?
http://lucene.apache.org/core/4_0_0-BETA/ see the file formats section, especially http://lucene.apache.org/core/4_0_0-BETA/core/org/apache/lucene/codecs/lucene40/Lucene40PostingsFormat.html#Termdictionary (since blocktree "covers" term dictionary and terms index) > > I'll be digging into the default codec docs and testing sometime in next > week or two (with a 2 billion term index) If I understand it well enough, > I'll be happy to draft some changes up for either the wiki or Solr the > example solrconfig.xml file. right i think we should remove these parameters. > > Does this mean that the default codec will reduce memory use for the terms > index enough so I don't need to use either of these settings to deal with > my > 2 billion term indexes? probably. i dont know enough about your terms or how much RAM you have to say for sure. if not, just customize blocktree's params with a CodecFactory in solr, or even pick another implementation (FixedGap, VariableGap, whatever). the interval/divisor stuff is mostly only useful if you are not reindexing from scratch: e.g. if you are gonna plop your 3.x index into 4.x then you should set those to whatever you were using before, since it will be using PreflexCodec to read those. -- lucidworks.com