Thanks Robert, >>if not, just customize blocktree's params with a CodecFactory in solr, >>or even pick another implementation (FixedGap, VariableGap, whatever).
Still trying to get my head around 4.0 and flexible indexing. I'll take another look at Mike's and your presentations. I'm trying to figure out how to get from the Lucene JavaDocs you pointed out to how to specify things in Solr and it's config files.. Is there an example CodecFactory somewhere I could look at? Also is Is there an example somewhere of how to specify a CodecFactory/Codec in Solr using the schema.xml or solrconfig.xml? Is there some simple way to specify minBlockSize and maxBlockSize in schema.xml? Once I get this all working and understand it, I'll be happy to draft some documentation. I'm really looking forward to experimenting with 4.0! Tom Tom On Fri, Sep 7, 2012 at 2:58 PM, Robert Muir <rcm...@gmail.com> wrote: > On Fri, Sep 7, 2012 at 2:19 PM, Tom Burton-West <tburt...@umich.edu> > wrote: > > Thanks Robert, > > > > I'll have to spend some time understanding the default codec for Solr > 4.0. > > Did I miss something in the changes file? > > http://lucene.apache.org/core/4_0_0-BETA/ > > see the file formats section, especially > > http://lucene.apache.org/core/4_0_0-BETA/core/org/apache/lucene/codecs/lucene40/Lucene40PostingsFormat.html#Termdictionary > > (since blocktree "covers" term dictionary and terms index) > > > > > I'll be digging into the default codec docs and testing sometime in next > > week or two (with a 2 billion term index) If I understand it well > enough, > > I'll be happy to draft some changes up for either the wiki or Solr the > > example solrconfig.xml file. > > right i think we should remove these parameters. > > > > > Does this mean that the default codec will reduce memory use for the > terms > > index enough so I don't need to use either of these settings to deal with > > my > 2 billion term indexes? > > probably. i dont know enough about your terms or how much RAM you have > to say for sure. > > if not, just customize blocktree's params with a CodecFactory in solr, > or even pick another implementation (FixedGap, VariableGap, whatever). > > the interval/divisor stuff is mostly only useful if you are not > reindexing from scratch: e.g. if you are gonna plop your 3.x index > into 4.x then you should set > those to whatever you were using before, since it will be using > PreflexCodec to read those. > > -- > lucidworks.com >