Hi What I do is I put the date created for when the doc was inserted or updated and then I do a search/delete query based on that
Mav On 01/05/2012 15:31, "Bai Shen" <baishen.li...@gmail.com> wrote: >I'm running Nutch, so it's updating the documents, but I'm wanting to >remove ones that are no longer available. So in that case, there's no >update possible. > >On Tue, May 1, 2012 at 8:47 AM, mav.p...@holidaylettings.co.uk < >mav.p...@holidaylettings.co.uk> wrote: > >> Not sure if there is an automatic way but we do it via a delete query >>and >> where possible we update doc under same id to avoid deletes. >> >> >> >> >> >> On 01/05/2012 13:43, "Bai Shen" <baishen.li...@gmail.com> wrote: >> >> >What is the best method to remove old documents? Things that no >>generate >> >404 errors, etc. >> > >> >Is there an automatic method or do I have to do it manually? >> > >> >THanks. >> >>