Just as a follow up

it looks like stored fields are stored verbatim for every doc.

hotel index and store dest attributes
index size: 131M
number of records 49147

hotel index only dest attributes

index size: 111m
number of records 49147


~400 chars(bytes) of destination data * 49147 (number of hotel docs) = ~19m

basically everything is being stored

No difference in time to index (very rough and not scientific :-) )

So it does seem an ok strategy to denormalise docs with index fields
but normalise with stored fields ?
Or have i missed some problems with this ?

cheers lee c



On 16 October 2011 11:54, lee carroll <lee.a.carr...@googlemail.com> wrote:
> Hi Chris thanks for the response
>
>> It's an inverted index, so *tems* exist once (per segment) and those terms
>> "point" to the documents -- so having the same terms (in the same fields)
>> for multiple types of documents in one index is going to take up less
>> overall space then having distinct collections for each type of document.
>
> I'm not asking about the indexed terms but rather the stored values.
> By having two doc types are we gaining anything by "storing"
> attributes only for that doc type
>
> cheers lee c
>

Reply via email to