Yes, Nutch works quite well as a crawler for Solr.
- Original Message -
From: "Tony Wang"
To: solr-user@lucene.apache.org
Sent: Thursday, March 5, 2009 5:32:57 PM GMT -06:00 US/Canada Central
Subject: what crawler do you use for Solr indexing?
Hi,
I wonder if there's any open source cra
I'm wondering, is there some way ("out of the box") to tell Solr that
we're only interested in indexing certain parts of a page? For example,
let's say I have a bunch of pages in my site that contain some common
navigation elements, roughly like this:
Stuff here about parts