I'm running Nutch, so it's updating the documents, but I'm wanting to remove ones that are no longer available. So in that case, there's no update possible.
On Tue, May 1, 2012 at 8:47 AM, mav.p...@holidaylettings.co.uk < mav.p...@holidaylettings.co.uk> wrote: > Not sure if there is an automatic way but we do it via a delete query and > where possible we update doc under same id to avoid deletes. > > > > > > On 01/05/2012 13:43, "Bai Shen" <baishen.li...@gmail.com> wrote: > > >What is the best method to remove old documents? Things that no generate > >404 errors, etc. > > > >Is there an automatic method or do I have to do it manually? > > > >THanks. > >