Thanks Erick for your advance and share. Regards,
On Mon, Dec 2, 2013 at 11:06 PM, Erick Erickson <erickerick...@gmail.com>wrote: > TieredMergePolicy is the default even though it's > commented out in solrconfig, it's still being used. > So there's nothing to do. > > Given the size of your index, you can actually do > whatever you please. Optimizing it will shrink its size, > but frankly your index is so small I doubt you'll see any > noticeable difference. They'll self-purge as you re-crawl > eventually. > > In all, I think you can mostly ignore the issue. > > Best, > Erick > > > On Sun, Dec 1, 2013 at 8:00 PM, Bayu Widyasanyata > <bwidyasany...@gmail.com>wrote: > > > Hi Erick, > > > > After waiting for some days abt. a week (I did daily crawling & > indexing), > > here are the docs summary: > > > > Num Docs: 9738 > > Max Doc: 15311 > > Deleted Docs: 5573 > > Version: 781 > > Segment Count: 5 > > > > The percentage of deletedDocs of NumDocs is near 57%. > > > > In the other, the TieredMergePolicy in solrconfig.xml is still disabled. > > > > <!-- > > <mergePolicy class="org.apache.lucene.index.TieredMergePolicy"> > > <int name="maxMergeAtOnce">10</int> > > <int name="segmentsPerTier">10</int> > > </mergePolicy> > > --> > > > > Should we enable it and wait for the effect? > > > > Thanks! > > > > > > > > On Wed, Nov 20, 2013 at 9:55 PM, Bayu Widyasanyata > > <bwidyasany...@gmail.com>wrote: > > > > > Thanks Erick. > > > I will check that on next round. > > > > > > --- > > > wassalam, > > > [bayu] > > > > > > /sent from Android phone/ > > > On Nov 20, 2013 7:45 PM, "Erick Erickson" <erickerick...@gmail.com> > > wrote: > > > > > >> You probably shouldn't optimize at all. The default TieredMergePolicy > > >> will eventually purge the deleted files' data, which is really what > > >> optimize > > >> does. So despite its name, most of the time it's not really worth the > > >> effort. > > >> > > >> Take a look at your Solr admin page, the "overview" link for a core. > > >> If the number of deleted docs is a significant percentage of your > > >> numDocs (I typically use 20% or so, but YMMV) then optimize > > >> might be worthwhile. Otherwise, it's a distraction unless and until > > >> you have some evidence that it actually makes a difference. > > >> > > >> Best, > > >> Erick > > >> > > >> > > >> On Wed, Nov 20, 2013 at 7:33 AM, Bayu Widyasanyata > > >> <bwidyasany...@gmail.com>wrote: > > >> > > >> > Hi, > > >> > > > >> > After successfully configured re-crawling script, I sometimes > checked > > >> and > > >> > found on Solr Admin that "Optimized" status of my collection is not > > >> > optimized (slash icon). > > >> > > > >> > Hence I did optimized steps manually. > > >> > > > >> > How to make my crawling optimized automatically? > > >> > > > >> > Should we restart Solr (I use Tomcat) as shown on here [1] > > >> > > > >> > [1] http://wiki.apache.org/nutch/Crawl > > >> > > > >> > Thanks! > > >> > > > >> > -- > > >> > wassalam, > > >> > [bayu] > > >> > > > >> > > > > > > > > > -- > > wassalam, > > [bayu] > > > -- wassalam, [bayu]