Alexey,

I've got that you need to get number of deleted docs in the index
http://localhost:8983/solr/admin/mbeans?stats=true&cat=CORE

here it goes
<int name="numDocs">27</int>
<int name="maxDoc">30</int>
<int name="deletedDocs">3</int>

if you need to get detailed segmentation, parse
<str name="reader">StandardDirectoryReader(segments_b:19:nrt _6(4.10.2):C9
_7(4.10.2):C9 _8(4.10.2):C9)</str>

note C# is number of docs in the segments, number of deletes are also
exposed at that toString().

Did I get what you need right?

PS: note the recent Mr McCandless G+, where he charted deleted docs ratio
under cinstant reindexing.





On Mon, Dec 8, 2014 at 2:23 PM, Dmitry Kan <solrexp...@gmail.com> wrote:

> Hi Alexey,
>
> In GUI luke there is an option to "Just expunge deleted docs without
> re-merging". In case you want to give it a try.
>
> Dmitry
>
> On Wed, Dec 3, 2014 at 1:35 PM, Alexey Kozhemiakin <
> alexey_kozhemia...@epam.com> wrote:
>
> > Dear All,
> >
> > We have a high percentage of deleted docs which do not go away because
> > there are several huge ancient segments that do not merge with anything
> > else naturally. Our use case in constant reindexing of same data - ~100
> gb,
> > 12 000 000 real records, 20 000 000 total records in index, which is ~80%
> > deletes.
> >
> > We plan to deal with situation by playing with mergeFactor,
> > reclaimDeletesWeight and maxSegmentSizeMB settings to optimize for our
> > re-indexing rate and data size.
> > And in order to do it with eyes-opened we want to see a picture similar
> to
> >
> http://blog.mikemccandless.com/2011/02/visualizing-lucenes-segment-merges.html
> > with columns of segment size and %of deletes.
> > The plan is to expose SegmentInfos via /admin/luke handler and draw
> column
> > bars in Solr admin.
> >
> > Is there an easier way to achieve that? Even in raw Luke we didn't' found
> > these data.
> >
> > We'd be happy to push the changes to Solr afterwards.
> >
> >
> > Thank you,
> > Alexey Kozhemiakin
> >
> >
>
>
> --
> Dmitry Kan
> Luke Toolbox: http://github.com/DmitryKey/luke
> Blog: http://dmitrykan.blogspot.com
> Twitter: http://twitter.com/dmitrykan
> SemanticAnalyzer: www.semanticanalyzer.info
>



-- 
Sincerely yours
Mikhail Khludnev
Principal Engineer,
Grid Dynamics

<http://www.griddynamics.com>
<mkhlud...@griddynamics.com>

Reply via email to