Just as a follow up it looks like stored fields are stored verbatim for every doc.
hotel index and store dest attributes index size: 131M number of records 49147 hotel index only dest attributes index size: 111m number of records 49147 ~400 chars(bytes) of destination data * 49147 (number of hotel docs) = ~19m basically everything is being stored No difference in time to index (very rough and not scientific :-) ) So it does seem an ok strategy to denormalise docs with index fields but normalise with stored fields ? Or have i missed some problems with this ? cheers lee c On 16 October 2011 11:54, lee carroll <lee.a.carr...@googlemail.com> wrote: > Hi Chris thanks for the response > >> It's an inverted index, so *tems* exist once (per segment) and those terms >> "point" to the documents -- so having the same terms (in the same fields) >> for multiple types of documents in one index is going to take up less >> overall space then having distinct collections for each type of document. > > I'm not asking about the indexed terms but rather the stored values. > By having two doc types are we gaining anything by "storing" > attributes only for that doc type > > cheers lee c >