The answer to that question, Norberto, would depend on versions.

George: why not just use straight Nutch and forget about Heritrix?

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

----- Original Message ----
From: Norberto Meijome <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Cc: [EMAIL PROTECTED]
Sent: Thursday, November 22, 2007 5:54:32 PM
Subject: Re: Heritrix and Solr

On Thu, 22 Nov 2007 10:41:41 -0500
George Everitt <[EMAIL PROTECTED]> wrote:

> After a lot of googling, I came across Heritrix, which seems to be
 the  
> most robust well supported open source crawler out there.   Heritrix
  
> has an integration with Nutch (NutchWax), but not with Solr.   I'm  
> wondering if anybody can share any experience using Heritrix with
 Solr.

out on a limb here... both Nutch and SOLR use Lucene for the actual
 indexing / searching. Would the indexes generated with Nutch be compatible
 / readable with SOLR? 

_________________________
{Beto|Norberto|Numard} Meijome

"Why do you sit there looking like an envelope without any address on
 it?"
  Mark Twain

I speak for myself, not my employer. Contents may be hot. Slippery when
 wet. Reading disclaimers makes you go blind. Writing them is worse.
 You have been Warned.



Reply via email to