On Fri, Dec 26, 2014 at 12:26 PM, Erick Erickson <erickerick...@gmail.com> wrote: > I don't know the complete algorithm, but if the number of docs that > satisfy the fq is "small enough", > then just the internal Lucene doc IDs are stored rather than a bitset.
If smaller than maxDoc/64 ids are collected, a sorted int set is used instead of a bitset. Also, the enum method can skip caching for the "smaller" terms: facet.enum.cache.minDf=100 might be good for general purpose. Or set the value really high to not use the filter cache at all. -Yonik