Re: Multiple document types

2012-01-25 Thread Simon Willnauer
o:simon.willna...@googlemail.com] > Sent: Wednesday, January 25, 2012 2:08 PM > To: java-user > Subject: Re: Multiple document types > > hey Frank, > > can you elaborate what you mean by different doc types? Are you > referring to an entity ie. a table per entity to

RE: Multiple document types

2012-01-25 Thread Frank DeRose
al Message- From: Simon Willnauer [mailto:simon.willna...@googlemail.com] Sent: Wednesday, January 25, 2012 2:08 PM To: java-user Subject: Re: Multiple document types hey Frank, can you elaborate what you mean by different doc types? Are you referring to an entity ie. a table per entity to sp

Re: multiple document types in a core

2011-10-24 Thread lee carroll
Hi Erick, Your right I think. On resources we gain a little bit on: disk (a production implementation with live data would be 500 mb saved in disk usage on each slave and master) some reduction in network traffic on replication (we do a full re-index every 24 hours at present) On design we gain a

Re: multiple document types in a core

2011-10-23 Thread Erick Erickson
Yes, stored fields are placed verbatim for every doc. But I wonder at the utility of trying to share stored information. The stored info is put in certain files in the index, see: http://lucene.apache.org/java/3_0_2/fileformats.html#file-names and the files that store data are pretty much irreleva

Re: multiple document types in a core

2011-10-17 Thread lee carroll
Just as a follow up it looks like stored fields are stored verbatim for every doc. hotel index and store dest attributes index size: 131M number of records 49147 hotel index only dest attributes index size: 111m number of records 49147 ~400 chars(bytes) of destination data * 49147 (number of

Re: multiple document types in a core

2011-10-16 Thread lee carroll
Hi Chris thanks for the response > It's an inverted index, so *tems* exist once (per segment) and those terms > "point" to the documents -- so having the same terms (in the same fields) > for multiple types of documents in one index is going to take up less > overall space then having distinct col

Re: multiple document types in a core

2011-10-15 Thread Chris Hostetter
: Basically we can search hotels using city attributes but to display : city data for a chosen hotel we would search for that city document to : retrieve values. : : Do we gain anything here ? Basically would the city fields associated : with hotels be stored and repeated 74500 less times or are