The answer to that question, Norberto, would depend on versions. George: why not just use straight Nutch and forget about Heritrix?
Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- From: Norberto Meijome <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Cc: [EMAIL PROTECTED] Sent: Thursday, November 22, 2007 5:54:32 PM Subject: Re: Heritrix and Solr On Thu, 22 Nov 2007 10:41:41 -0500 George Everitt <[EMAIL PROTECTED]> wrote: > After a lot of googling, I came across Heritrix, which seems to be the > most robust well supported open source crawler out there. Heritrix > has an integration with Nutch (NutchWax), but not with Solr. I'm > wondering if anybody can share any experience using Heritrix with Solr. out on a limb here... both Nutch and SOLR use Lucene for the actual indexing / searching. Would the indexes generated with Nutch be compatible / readable with SOLR? _________________________ {Beto|Norberto|Numard} Meijome "Why do you sit there looking like an envelope without any address on it?" Mark Twain I speak for myself, not my employer. Contents may be hot. Slippery when wet. Reading disclaimers makes you go blind. Writing them is worse. You have been Warned.