Victor,

Yes, if you use the same ID (and a URL could serve as a Document ID), Solr will 
update the Document.
Note that Solr doesn't do crawling/web page fetching, but Nutch and Droids do.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: "Huang, Zijian(Victor)" <zijian.hu...@etrade.com>
> To: solr-user@lucene.apache.org
> Sent: Wednesday, March 18, 2009 4:44:30 PM
> Subject: RE: Question about incremental index update
> 
> Hi, Otis:
>    so does Solr already has some kind of libraries build-in, which it
> can automatically detect the different within two set of crawled
> documents and update the index to the newer one? 
>     I mean the document ID in Slor xml doc format. Inside the Solr wiki,
> it tells me that I can update a particular doc by its ID if I assigned
> one previously. I am thinking if using the url as the doc ID will be a
> good thing to do.
> 
> Thanks
> 
> Vic
> 
> -----Original Message-----
> From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] 
> Sent: Wednesday, March 18, 2009 12:43 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Question about incremental index update
> 
> 
> Victor,
> 
> Daily updates (or hourly or more frequent) are not going to be a
> problem.  I don't follow your question about document ID and using URL.
> 
> 
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> 
> 
> 
> ----- Original Message ----
> > From: "Huang, Zijian(Victor)" 
> > To: solr-user@lucene.apache.org
> > Sent: Wednesday, March 18, 2009 2:51:59 PM
> > Subject: Question about incremental index update
> > 
> > Hi:
> >    Is it easy to do daily incremental index update in Solr assuming 
> > the index is around 1G? In terms of giving a document an ID to 
> > facilitate index update, is it using the URL a good way to do so?
> > 
> > Thanks
> > 
> > 
> > Victor

Reply via email to