On 12/1/2016 8:16 AM, Dorian Hoxha wrote: > @Shawn > Any idea why the cache doesn't use roaring bitsets ?
I had to look that up to even know what it was. Apparently Lucene does have an implementation of that, a class called RoaringDocIdSet. It was incorporated into the source code in October 2014 with this issue: https://issues.apache.org/jira/browse/LUCENE-5983 As for the reason that it wasn't used for the filterCache, I think that's because the filterCache existed LONG before that bitset implementation was available, and when things work well (which describes the filterCache), devs try not to mess with them too much. I have mentioned the idea on a recently-filed issue regarding bitset memory efficiency: https://issues.apache.org/jira/browse/SOLR-9764 Thanks, Shawn