Somehow I missed that there was a solrclean command. Thanks. On Tue, May 1, 2012 at 10:41 AM, Markus Jelsma <markus.jel...@openindex.io>wrote:
> Nutch 1.4 has a separate tool to remove 404 and redirects documents from > your > index based on your CrawlDB. Trunk's SolrIndexer can add and remove > documents > in one run based on segment data. > > On Tuesday 01 May 2012 16:31:47 Bai Shen wrote: > > I'm running Nutch, so it's updating the documents, but I'm wanting to > > remove ones that are no longer available. So in that case, there's no > > update possible. > > > > On Tue, May 1, 2012 at 8:47 AM, mav.p...@holidaylettings.co.uk < > > > > mav.p...@holidaylettings.co.uk> wrote: > > > Not sure if there is an automatic way but we do it via a delete query > and > > > where possible we update doc under same id to avoid deletes. > > > > > > On 01/05/2012 13:43, "Bai Shen" <baishen.li...@gmail.com> wrote: > > > >What is the best method to remove old documents? Things that no > > > >generate 404 errors, etc. > > > > > > > >Is there an automatic method or do I have to do it manually? > > > > > > > >THanks. > > -- > Markus Jelsma - CTO - Openindex >