Thank you Otis, I will for sure check on this wa salaam, Muhammed Sameer
--- On Tue, 5/26/09, Otis Gospodnetic <otis_gospodne...@yahoo.com> wrote: > From: Otis Gospodnetic <otis_gospodne...@yahoo.com> > Subject: Re: Index size concerns > To: solr-user@lucene.apache.org > Date: Tuesday, May 26, 2009, 1:01 PM > > Muhammed, > > It sounds like you are talking about the ratio of original > data size vs. index size. The exact ratio depends on > things such as: > - whether you store fields or just index them > - whether you compress fields if you store them > - whether you have term vectors enabled or not > - analyzers and what they do - they could stem tokens, > remove them, etc., but they could also insert synonyms, and > so on > - nature of the input text - term distribution/variance > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > ----- Original Message ---- > > From: Muhammed Sameer <samix_...@yahoo.com> > > To: solr-user@lucene.apache.org > > Sent: Monday, May 25, 2009 1:22:15 PM > > Subject: Re: Index size concerns > > > > > > Salaam, > > > > Sorry for this here is the big picture > > > > Actually we use solr to index all the mails that come > to us so that we can allow > > for faster look ups. > > > > We have seen that after our mail server accepts say a > GB of mails the index size > > goes upto 800MB > > > > I hope that this time I am clear in conveying the > problem > > > > What I wanted to know is that is this index size > normal ? > > > > Regards, > > Muhammed Sameer > > > > --- On Mon, 5/25/09, Shalin Shekhar Mangar wrote: > > > > > From: Shalin Shekhar Mangar > > > Subject: Re: Index size concerns > > > To: solr-user@lucene.apache.org > > > Date: Monday, May 25, 2009, 11:19 AM > > > On Mon, May 25, 2009 at 3:53 PM, > > > Muhammed Sameer wrote: > > > > > > > > > > > We are using apache-solr to index our files > for faster > > > searches, all things > > > > happen without a problem, my only concern is > the size > > > of the cache. > > > > > > > > It seems that the trend is that the if I > cache 1 GB of > > > files the index goes > > > > to 800MB ie we are seeing a 80% cache size. > > > > > > > > Is this normal or am I missing something in > the > > > configuration of solr > > > > > > > > > > I'm sorry I do not understand your question. > Which files > > > are you talking > > > about? The Solr cache has got nothing to do with > files. It > > > caches the > > > query/filter results and solr documents. > > > > > > -- > > > Regards, > > > Shalin Shekhar Mangar. > > > > >