On 7/17/2013 8:06 AM, Furkan KAMACI wrote:
> I have crawled some web pages and indexed them at my SolrCloud(Solr 4.2.1).
> However before I index them there was already some indexes. I can calculate
> the difference between current and previous document count. However it
> doesn't mean that I have indexed that count of documents. Because urls of
> websites are unique ids at my system. So it means that some of documents
> updated and they did not increased document count.
> 
> My question is that: How can I learn the total count of how many documents
> indexed and how many documents updated?

Look at the update handler statistics.  Your application should record
the numbers there, then you can check the handler statistics again and
note the differences.  Here's a URL that can give you those statistics.

http://server:port/solr/mycollectionname/admin/mbeans?stats=true

They are also available in the UI on the UPDATEHANDLER section of
Plugins / Stats, but you can't really use that in a program.

By setting the request handler path on a query object to /admin/mbeans
and setting the stats parameter, you can get this information with SolrJ.

Thanks,
Shawn

Reply via email to