On 8/17/07, Kevin Lewandowski <[EMAIL PROTECTED]> wrote:
> Are there any tips on reducing the index size or what factors most
> impact index size?
>
> My index has 2.7 million documents and is 200 gigabytes and growing.
> Most documents are around 2-3kb and there are about 30 indexed fields.

Wow, that's pretty big for the document count!
- make sure that you only store fields you need to retrieve... if you
only need to search on the fields, make them indexed-only.
- unique terms take up more space... if you have date or time fields,
try reducing the time resolution
- if any stored fields are very large, perhaps try compression
- application specific compression...  for example, if you have a lot
of URL values starting with the same thing change "http://"; to a
single unique character.

-Yonik

Reply via email to