Re: solr-duplicate post management

2009-01-26 Thread S.Selvam Siva
On Tue, Jan 27, 2009 at 5:03 AM, Chris Hostetter wrote: > > : Hi, i added some code to *DirectUpdateHandler2.java's doDeletions()* > (solr > : 1.2.0) ,and got the solution i wanted.(logging duplicate post entry-i.e > old > : field and new field of duplicate post) > : > : > :Document d1=sea

Re: solr-duplicate post management

2009-01-24 Thread S.Selvam Siva
On Thu, Jan 22, 2009 at 2:33 PM, S.Selvam Siva wrote: > > > On Thu, Jan 22, 2009 at 7:12 AM, Chris Hostetter > wrote: > >> >> : what i need is ,to log the existing urlid and new urlid(of course both >> will >> : not be same) ,when a .xml file of same id(un

Re: stats.jsp - maxDoc and numDoc-help

2009-01-23 Thread S.Selvam Siva
On Fri, Jan 23, 2009 at 10:54 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > Hello, > > Those two numbers won't necessarily give you the number of duplicates, as > they reflect the number of deletes in the index, and those deletes were not > necessarily caused by Solr detecting a dupl

stats.jsp - maxDoc and numDoc-help

2009-01-23 Thread S.Selvam Siva
Hi all, i am new to solr.I have posted nearly 10 lakh xml docs for the last few months. Now i want to find out the total number of duplicate posts untill now. whether the stats.jsp's numDocs and maxDocs is the appropriate one to find out the total duplicate post(maxDocs-numDocs) so far? please

Re: solr-duplicate post management

2009-01-22 Thread S.Selvam Siva
On Thu, Jan 22, 2009 at 7:12 AM, Chris Hostetter wrote: > > : what i need is ,to log the existing urlid and new urlid(of course both > will > : not be same) ,when a .xml file of same id(unique field) is posted. > : > : I want to make this by modifying the solr source.Which file do i need to > : mo

solr-duplicate post management

2009-01-11 Thread S.Selvam Siva
Hi, I have 6 fields in my solr-schema. 1)id(unique key) 2)urlid 3)url and so on to 6) We have been posting 3 to 4 lakh .xml files per day which includes 50% duplicate posts. what i need is ,to log the existing urlid and new urlid(of course both will not be same) ,when a .xml file o