Hi Marc,

Think about how one would go about implementing a manual database table to
table synchronization. It may not be a good idea to iterate all rows from
the target database and checking for existence in the source database to
remove rows which were deleted in the source table. The best way to
implement this is through a transaction log (which is exactly how MySql
replication works). A similar approach is used in Solr's DataImportHandler (
http://wiki.apache.org/solr/DataImportHandler) to sync databases to Solr
where you must maintain a table to track deletes. This table can be used to
delete documents from Solr.
On Fri, May 9, 2008 at 11:41 PM, Marc Bechler <[EMAIL PROTECTED]>
wrote:

> Hi all,
>
> one possible use case could be to synchronize the index against a given
> database. E.g., assume that you have a filesystem that is indexed
> periodically. If files are deleted on this filesystem, they will not be
> deleted in the index. This way, you can get (e.g.) the complete content from
> your index in order to check for consistency.
>
> Btw: I also played around with the rows parameter in order to get the
> overall index; but I got exceptions ("not sufficient heap space"), when
> setting up rows above some higher thresholds.
>
> Regards,
>
>  marc
>
>
> Erik Hatcher schrieb:
>
>  Or make two requests...  one with rows=0 to see how many documents match
> > without retrieving any, then another with that amount specified.
> >
> >    Erik
> >
> >
> > On May 9, 2008, at 8:54 AM, Francisco Sanmartin wrote:
> >
> > > Yeah, I understand the possible problems of changing this value. It's
> > > just a very particular case and there won't be a lot of documents to 
> > > return.
> > > I guess I'll have to use a very high int number, I just wanted to know if
> > > there was any "proper" configuration for this situation.
> > >
> > > Thanks for the answer!
> > >
> > > Pako
> > >
> > >
> > > Otis Gospodnetic wrote:
> > >
> > > > Will something a la rows=<max int here> work? ;) But are you sure
> > > > you want to do that?  It could be sloooooow.
> > > >
> > > >
> > > > Otis
> > > > --
> > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> > > >
> > > >
> > > > ----- Original Message ----
> > > >
> > > >  From: Francisco Sanmartin <[EMAIL PROTECTED]>
> > > > > To: solr-user@lucene.apache.org
> > > > > Sent: Thursday, May 8, 2008 4:18:46 PM
> > > > > Subject: Unlimited number of return documents?
> > > > >
> > > > > What is the value to set to "rows" in solrconfig.xml in order not
> > > > > to have any limitation about the number of returned documents? I've 
> > > > > tried
> > > > > with "-1" and "0" but not luck...
> > > > >
> > > > > solr 0 name="rows">*10*
> > > > > I want solr to return all available documents by default.
> > > > >
> > > > > Thanks!
> > > > >
> > > > > Pako
> > > > >
> > > > >
> > > >
> > > >
> > > >
> > > >
> >
>


-- 
Regards,
Shalin Shekhar Mangar.

Reply via email to