Fuad Efendi wrote: > Simple field (10 different values: Canada, USA, UK, ...), 64-bit JVM... no > difference between maxdoc and maxdoc + 1 for such estimate... difference is > between 0.4Gb and 1.2Gb... > > I'm not sure I understand - but I didn't mean to imply the +1 on maxdoc meant anything. The issue is that in the end, it only needs a String array the size of String[UniqueTerms] - but because it can't easily figure out that number, it first creates an array of String[MaxDoc+1] - so with a ton of docs and a few uniques, you get a temp boost in the RAM reqs until it sizes it down. A pointer for each doc.
-- - Mark http://www.lucidimagination.com