: 1)then i can think of that "maxDocs- numDocs " should be the maximum(upper
: bound) duplicate post count so far,if i assume no other deletion happened
: other than duplication deletion.
not neccessarily -- when Lucene merges segments (which can happen on any
add) deletes get flushed from the se
On Fri, Jan 23, 2009 at 10:54 PM, Otis Gospodnetic <
otis_gospodne...@yahoo.com> wrote:
> Hello,
>
> Those two numbers won't necessarily give you the number of duplicates, as
> they reflect the number of deletes in the index, and those deletes were not
> necessarily caused by Solr detecting a dupl
Nutch
- Original Message
> From: S.Selvam Siva
> To: solr-user@lucene.apache.org
> Sent: Friday, January 23, 2009 3:33:56 AM
> Subject: stats.jsp - maxDoc and numDoc-help
>
> Hi all,
>
> i am new to solr.I have posted nearly 10 lakh xml docs for the last few
> months.
Hi all,
i am new to solr.I have posted nearly 10 lakh xml docs for the last few
months.
Now i want to find out the total number of duplicate posts untill now.
whether the stats.jsp's numDocs and maxDocs is the appropriate one to find
out the total duplicate post(maxDocs-numDocs) so far?
please