Victor, Yes, if you use the same ID (and a URL could serve as a Document ID), Solr will update the Document. Note that Solr doesn't do crawling/web page fetching, but Nutch and Droids do.
Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: "Huang, Zijian(Victor)" <zijian.hu...@etrade.com> > To: solr-user@lucene.apache.org > Sent: Wednesday, March 18, 2009 4:44:30 PM > Subject: RE: Question about incremental index update > > Hi, Otis: > so does Solr already has some kind of libraries build-in, which it > can automatically detect the different within two set of crawled > documents and update the index to the newer one? > I mean the document ID in Slor xml doc format. Inside the Solr wiki, > it tells me that I can update a particular doc by its ID if I assigned > one previously. I am thinking if using the url as the doc ID will be a > good thing to do. > > Thanks > > Vic > > -----Original Message----- > From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] > Sent: Wednesday, March 18, 2009 12:43 PM > To: solr-user@lucene.apache.org > Subject: Re: Question about incremental index update > > > Victor, > > Daily updates (or hourly or more frequent) are not going to be a > problem. I don't follow your question about document ID and using URL. > > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > ----- Original Message ---- > > From: "Huang, Zijian(Victor)" > > To: solr-user@lucene.apache.org > > Sent: Wednesday, March 18, 2009 2:51:59 PM > > Subject: Question about incremental index update > > > > Hi: > > Is it easy to do daily incremental index update in Solr assuming > > the index is around 1G? In terms of giving a document an ID to > > facilitate index update, is it using the URL a good way to do so? > > > > Thanks > > > > > > Victor