Re: Removing old documents

2012-05-02 Thread alxsss
rl to delete a document. Is it possible that the issue is because of this mismatch? Thanks. Alex. -Original Message- From: Paul Libbrecht To: solr-user Sent: Tue, May 1, 2012 11:43 pm Subject: Re: Removing old documents With which client? paul Le 2 mai 2012 à 01:29, alx...@aim.

Re: Removing old documents

2012-05-02 Thread Bai Shen
Somehow I missed that there was a solrclean command. Thanks. On Tue, May 1, 2012 at 10:41 AM, Markus Jelsma wrote: > Nutch 1.4 has a separate tool to remove 404 and redirects documents from > your > index based on your CrawlDB. Trunk's SolrIndexer can add and remove > documents > in one run base

Re: Removing old documents

2012-05-01 Thread Paul Libbrecht
With which client? paul Le 2 mai 2012 à 01:29, alx...@aim.com a écrit : > all caching is disabled and I restarted jetty. The same results.

Re: Removing old documents

2012-05-01 Thread alxsss
all caching is disabled and I restarted jetty. The same results. Thanks. Alex. -Original Message- From: Lance Norskog To: solr-user Sent: Tue, May 1, 2012 2:57 pm Subject: Re: Removing old documents Maybe this is the HTTP caching feature? Solr comes with HTTP caching turned

Re: Removing old documents

2012-05-01 Thread Paul Libbrecht
gt;> >> >> >> >> >> >> -Original Message- >> From: Markus Jelsma >> To: solr-user >> Sent: Tue, May 1, 2012 7:41 am >> Subject: Re: Removing old documents >> >> >> Nutch 1.4 has a separate tool to remove

Re: Removing old documents

2012-05-01 Thread Lance Norskog
> > > > > > -Original Message- > From: Markus Jelsma > To: solr-user > Sent: Tue, May 1, 2012 7:41 am > Subject: Re: Removing old documents > > > Nutch 1.4 has a separate tool to remove 404 and redirects documents from your > index based on your Crawl

Re: Removing old documents

2012-05-01 Thread alxsss
. -Original Message- From: Markus Jelsma To: solr-user Sent: Tue, May 1, 2012 7:41 am Subject: Re: Removing old documents Nutch 1.4 has a separate tool to remove 404 and redirects documents from your index based on your CrawlDB. Trunk's SolrIndexer can add and remove docu

Re: Removing old documents

2012-05-01 Thread mav.p...@holidaylettings.co.uk
Hi What I do is I put the date created for when the doc was inserted or updated and then I do a search/delete query based on that Mav On 01/05/2012 15:31, "Bai Shen" wrote: >I'm running Nutch, so it's updating the documents, but I'm wanting to >remove ones that are no longer available. So i

Re: Removing old documents

2012-05-01 Thread Markus Jelsma
Nutch 1.4 has a separate tool to remove 404 and redirects documents from your index based on your CrawlDB. Trunk's SolrIndexer can add and remove documents in one run based on segment data. On Tuesday 01 May 2012 16:31:47 Bai Shen wrote: > I'm running Nutch, so it's updating the documents, but I

Re: Removing old documents

2012-05-01 Thread Bai Shen
I'm running Nutch, so it's updating the documents, but I'm wanting to remove ones that are no longer available. So in that case, there's no update possible. On Tue, May 1, 2012 at 8:47 AM, mav.p...@holidaylettings.co.uk < mav.p...@holidaylettings.co.uk> wrote: > Not sure if there is an automatic

Re: Removing old documents

2012-05-01 Thread mav.p...@holidaylettings.co.uk
Not sure if there is an automatic way but we do it via a delete query and where possible we update doc under same id to avoid deletes. On 01/05/2012 13:43, "Bai Shen" wrote: >What is the best method to remove old documents? Things that no generate >404 errors, etc. > >Is there an automatic