: Basically we can search hotels using city attributes but to display : city data for a chosen hotel we would search for that city document to : retrieve values. : : Do we gain anything here ? Basically would the city fields associated : with hotels be stored and repeated 74500 less times or are the values : stored once and pointers for : each hotel document kept to point at the city values ?
It's an inverted index, so *tems* exist once (per segment) and those terms "point" to the documents -- so having the same terms (in the same fields) for multiple types of documents in one index is going to take up less overall space then having distinct collections for each type of document. if you use *completely* different fields for each type of document (or use the same fields, but the documents have completley differnet terms in those fields) then you're better off with differnet collections. -Hoss