RE: maxDoc ten times greater than numDoc

2017-04-13 Thread Markus Jelsma
sense. Thanks, Markus -Original message- > From:Erick Erickson > Sent: Thursday 13th April 2017 17:07 > To: solr-user > Subject: Re: maxDoc ten times greater than numDoc > > If you want to be brave > > Through a clever bit of reflection, the parameters

Re: maxDoc ten times greater than numDoc

2017-04-13 Thread Erick Erickson
dexing as opposed to deleting only items older >> than the set i am already about to reindex. This strategy works fine with >> other reindexing programs, they don't suffer this problem. >> >> So, it is not solved, but not a problem anymore. Thanks all anyway :) >&

Re: maxDoc ten times greater than numDoc

2017-04-13 Thread Alexandre Rafalovitch
s strategy works fine with > other reindexing programs, they don't suffer this problem. > > So, it is not solved, but not a problem anymore. Thanks all anyway :) > Markus > > -Original message- >> From:Erick Erickson >> Sent: Wednesday 12th April 2017 17:5

RE: maxDoc ten times greater than numDoc

2017-04-13 Thread Markus Jelsma
To: solr-user > Subject: Re: maxDoc ten times greater than numDoc > > Yes, this is very strange. My bet: you have something > custom, a setting, indexing code, whatever that > is getting in the way. > > Second possibility (really stretching here): your > merge settings

Re: maxDoc ten times greater than numDoc

2017-04-12 Thread Erick Erickson
Yes, this is very strange. My bet: you have something custom, a setting, indexing code, whatever that is getting in the way. Second possibility (really stretching here): your merge settings are set to 10 segments having to exist before merging and somehow not all the docs in the segments are repla

Re: maxDoc ten times greater than numDoc

2017-04-12 Thread Shawn Heisey
On 4/12/2017 5:11 AM, Markus Jelsma wrote: > One of our 2 shard collections is rather small and gets all its entries > reindexed every 20 minutes orso. Now i just noticed maxDoc is ten times > greater than numDoc, the merger is never scheduled but settings are default. > We just overwrite the ex

RE: maxDoc ten times greater than numDoc

2017-04-12 Thread alessandro.benedetti
This may be incorrect, but I think that even if a merge happened and the disk space is actually released, the deleted docs count will still be there. What about your index size ? is the index 10 times bigger than expected ? Cheers - --- Alessandro Benedetti Search Consultant, R

RE: maxDoc ten times greater than numDoc

2017-04-12 Thread Markus Jelsma
To: solr-user@lucene.apache.org > Subject: Re: maxDoc ten times greater than numDoc > > Hi Markus, > maxDocs includes deletions : > > Deleted Docs: 74026 + Num Docs: 8336 = Max Doc:82362 > > Cheers > > > > - > --- > Alessandro Benede

Re: maxDoc ten times greater than numDoc

2017-04-12 Thread alessandro.benedetti
Hi Markus, maxDocs includes deletions : Deleted Docs: 74026 + Num Docs: 8336 = Max Doc:82362 Cheers - --- Alessandro Benedetti Search Consultant, R&D Software Engineer, Director Sease Ltd. - www.sease.io -- View this message in context: http://lucene.472066.n3.nab