Re: Best way to check Solr index for completeness

2010-09-29 Thread Erick Erickson
Yep, I was thinking of this on a field. I was assuming that there was a PK in the database that you were mapping to the uniqueKey field, but if that's not so then it's more of a problem. But you'd have problems anyway if you *don't* have a uniqueKey when it comes time to update any records, so it

Re: Best way to check Solr index for completeness

2010-09-29 Thread Walter Underwood
Think about what fields you need to return. For this, you probably only need the id. That could be a lot faster than the default set of fields. wunder On Sep 29, 2010, at 9:04 AM, dshvadskiy wrote: > > Actually retrieving 1000 docs via search isn't that bad. Turned out it takes > under 1 sec.

Re: Best way to check Solr index for completeness

2010-09-29 Thread dshvadskiy
Actually retrieving 1000 docs via search isn't that bad. Turned out it takes under 1 sec. I still like the idea of using TermComponent and will use it in the future if number of docs in the index will grow. Thanks for all suggestions. Dmitriy -- View this message in context: http://lucene.47206

Re: Best way to check Solr index for completeness

2010-09-29 Thread dshvadskiy
Regenerating index is a slow operation due to limitation of the source systems. We run several complex SQL statements to generate 1 Solr document. Full reindex takes about 24 hours. -- View this message in context: http://lucene.472066.n3.nabble.com/Best-way-to-check-Solr-index-for-completenes

Re: Best way to check Solr index for completeness

2010-09-29 Thread dshvadskiy
Using TermComponent is an interesting suggestion. However my understanding it will work only for unique terms. For example compare database primary key with Solr id field. A variation of that is to calculate some kind of unique record hash and store it in the index.Then retrieve id and hash via T

Re: Best way to check Solr index for completeness

2010-09-29 Thread Peter Karich
How long does it take to get 1000 docs? Why not ensure this while indexing? I think besides your suggestion or the suggestion of Luke there is no other way... Regards, Peter. > Hello, > What would be the best way to check Solr index against original system > (Database) to make sure index is up t

Re: Best way to check Solr index for completeness

2010-09-28 Thread Dennis Gearon
owded' Laugh at http://www.yert.com/film.php --- On Tue, 9/28/10, dshvadskiy wrote: > From: dshvadskiy > Subject: Re: Best way to check Solr index for completeness > To: solr-user@lucene.apache.org > Date: Tuesday, September 28, 2010, 2:11 PM > > That will certainly work for

Re: Best way to check Solr index for completeness

2010-09-28 Thread Erick Erickson
Have you looked at SOLRs TermComponent? Assuming you have a unique key, I think you could use TermsComponent to walk that field for comparing against your database rather then getting all the documents. HTH Erick On Tue, Sep 28, 2010 at 5:11 PM, dshvadskiy wrote: > > That will certainly work fo

Re: Best way to check Solr index for completeness

2010-09-28 Thread dshvadskiy
That will certainly work for most recent updates but I need to compare entire index. Dmitriy Luke Crouch wrote: > > Is there a 1:1 ratio of db records to solr documents? If so, couldn't you > simply select the most recent updated record from the db and check to make > sure the corresponding sol

Re: Best way to check Solr index for completeness

2010-09-28 Thread Luke Crouch
Is there a 1:1 ratio of db records to solr documents? If so, couldn't you simply select the most recent updated record from the db and check to make sure the corresponding solr doc has the same timestamp? -L On Tue, Sep 28, 2010 at 3:48 PM, Dmitriy Shvadskiy wrote: > Hello, > What would be the b