Re: Using remote Nutch Server to crawl, then merging results into local index

2010-12-23 Thread Erick Erickson
Merging the indexes seems problematical. It's easy enough to #code#, but I'm not sure it would produce results you want. And it supposes that your schemas are identical (or at least compatible) between the crawled data and your local data, which I wonder about... Instead, I'd think about cores. Co

Re: Using remote Nutch Server to crawl, then merging results into local index

2010-12-23 Thread Dominique Bejean
Hi, In order to crawl and index your web sites, may you can have a look at www.crawl-anywhere.com. It includes a web crawler, a document processing pipeline and a solr indexer. Dominique Le 23/12/10 16:27, Dietrich a écrit : I want to use Solr to index two types of documents: - local docume