Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
Hi Erick, I read mergeFactor Policy for indexing. By default, mergerFactor is 10. As said in document, High value merge factor (e.g., 25): - Pro: Generally improves indexing speed - Con: Less frequent merges, resulting in a collection with more index files which may slow searc

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
On Fri, Mar 20, 2015 at 1:35 AM, Erick Erickson wrote: > That or even hard commit to 60 seconds. It's strictly a matter of how often > you want to close old segments and open new ones. > > On Thu, Mar 19, 2015 at 3:12 AM, Nitin Solanki > wrote: > > Hi Erick.. > > I read your Articl

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Erick Erickson
That or even hard commit to 60 seconds. It's strictly a matter of how often you want to close old segments and open new ones. On Thu, Mar 19, 2015 at 3:12 AM, Nitin Solanki wrote: > Hi Erick.. > I read your Article. Really nice... > Inside that you said that for bulk indexing. Set s

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
Hi Erick.. I read your Article. Really nice... Inside that you said that for bulk indexing. Set soft commit = 10 mins and hard commit = 15sec. Is it also okay for my scenario? On Thu, Mar 19, 2015 at 1:53 AM, Erick Erickson wrote: > bq: As you said, do commits after 6 seconds >

Re: Whole RAM consumed while Indexing.

2015-03-19 Thread Nitin Solanki
Hi Alxeandre, Number of segment counts are different but document counts are same. With (soft commit - 300 and hardcommit - 6000) = No. of segment - 43 AND With (soft commit - 6 and hardcommit - 6) = No. of segment - 31 I dont' have any idea related to segment count

Re: Whole RAM consumed while Indexing.

2015-03-18 Thread Erick Erickson
bq: As you said, do commits after 6 seconds No, No, No. I'm NOT saying 6 seconds! That time is in _milliseconds_ as Shawn said. So setting it to 6 is every minute. >From solrconfig.xml, conveniently located immediately above the tag: maxTime - Maximum amount of time in ms that is al

Re: Whole RAM consumed while Indexing.

2015-03-18 Thread Alexandre Rafalovitch
Probably merged somewhat differently with some terms indexes repeating between segments. Check the number of segments in data directory.And do search for *:* and make sure both do have the same document counts. Also, In all these discussions, you still haven't answered about how fast after indexin

Re: Whole RAM consumed while Indexing.

2015-03-18 Thread Nitin Solanki
When I kept my configuration to 300 for soft commit and 3000 for hard commit and indexed some amount of data, I got the data size of the whole index to be 6GB after completing the indexing. When I changed the configuration to 6 for soft commit and 6 for hard commit and indexed same data th

Re: Whole RAM consumed while Indexing.

2015-03-18 Thread Shawn Heisey
On 3/18/2015 9:44 AM, Nitin Solanki wrote: > I am just saying. I want to be sure on commits difference.. > What if I do frequent commits or not? And why I am saying that I need to > commit things so very quickly because I have to index 28GB of data which > takes 7-8 hours(frequent comm

Re: Whole RAM consumed while Indexing.

2015-03-18 Thread Nitin Solanki
Hi Erick, I am just saying. I want to be sure on commits difference.. What if I do frequent commits or not? And why I am saying that I need to commit things so very quickly because I have to index 28GB of data which takes 7-8 hours(frequent commits). As you said, do commits after 6

Re: Whole RAM consumed while Indexing.

2015-03-18 Thread Erick Erickson
Don't do it. Really, why do you want to do this? This seems like an "XY" problem, you haven't explained why you need to commit things so very quickly. I suspect you haven't tried _searching_ while committing at such a rate, and you might as well turn all your top-level caches off in solrconfig.xml

Re: Whole RAM consumed while Indexing.

2015-03-18 Thread Nitin Solanki
Hi, If I do very very fast indexing(softcommit = 300 and hardcommit = 3000) v/s slow indexing (softcommit = 6 and hardcommit = 6) as you both said. Will fast indexing fail to index some data? Any suggestion on this ? On Tue, Mar 17, 2015 at 2:29 AM, Ramkumar R. Aiyengar < andyetitmo

Re: Whole RAM consumed while Indexing.

2015-03-16 Thread Ramkumar R. Aiyengar
Yes, and doing so is painful and takes lots of people and hardware resources to get there for large amounts of data and queries :) As Erick says, work backwards from 60s and first establish how high the commit interval can be to satisfy your use case.. On 16 Mar 2015 16:04, "Erick Erickson" wrote

Re: Whole RAM consumed while Indexing.

2015-03-16 Thread Erick Erickson
First start by lengthening your soft and hard commit intervals substantially. Start with 6 and work backwards I'd say. Ramkumar has tuned the heck out of his installation to get the commit intervals to be that short ;). I'm betting that you'll see your RAM usage go way down, but that' s a gue

Re: Whole RAM consumed while Indexing.

2015-03-15 Thread Nitin Solanki
Hi Erick, You are saying correct. Something, **"overlapping searchers" warning messages** are coming in logs. **numDocs numbers** are changing when documents are adding at the time of indexing. Any help? On Sat, Mar 14, 2015 at 11:24 PM, Erick Erickson wrote: > First, the soft commit

Re: Whole RAM consumed while Indexing.

2015-03-14 Thread Erick Erickson
First, the soft commit interval is very short. Very, very, very, very short. 300ms is just short of insane unless it's a typo ;). Here's a long background: https://lucidworks.com/blog/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud/ But the short form is that you're opening sear

Re: Whole RAM consumed while Indexing.

2015-03-12 Thread Nitin Solanki
Hi Alexandre, *Hard Commit* is : ${solr.autoCommit.maxTime:3000} false *Soft Commit* is : ${solr.autoSoftCommit.maxTime:300} And I am committing 2 documents each time. Is it good config for committing? Or I am good something wrong ? On Fri, Mar 13, 2015 a

Re: Whole RAM consumed while Indexing.

2015-03-12 Thread Alexandre Rafalovitch
What's your commit strategy? Explicit commits? Soft commits/hard commits (in solrconfig.xml)? Regards, Alex. Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 12 March 2015 at 23:19, Nitin Solanki wrote: > Hello, > I have written a

Whole RAM consumed while Indexing.

2015-03-12 Thread Nitin Solanki
Hello, I have written a python script to do 2 documents indexing each time on Solr. I have 28 GB RAM with 8 CPU. When I started indexing, at that time 15 GB RAM was freed. While indexing, all RAM is consumed but **not** a single document is indexed. Why so? And it through *HTTPError: