On 8/17/07, Kevin Lewandowski <[EMAIL PROTECTED]> wrote: > Are there any tips on reducing the index size or what factors most > impact index size? > > My index has 2.7 million documents and is 200 gigabytes and growing. > Most documents are around 2-3kb and there are about 30 indexed fields.
Wow, that's pretty big for the document count! - make sure that you only store fields you need to retrieve... if you only need to search on the fields, make them indexed-only. - unique terms take up more space... if you have date or time fields, try reducing the time resolution - if any stored fields are very large, perhaps try compression - application specific compression... for example, if you have a lot of URL values starting with the same thing change "http://" to a single unique character. -Yonik