If that is the case you should look @ the DataImportHandler examples
as they can already index RSS, im doing it now for ~ a dozen feeds on
an hourly basis. (This is also for any XML-based feed for XHTML, XML,
etc). I find Nutch more useful for plain vanilla HTML (something that
was built
Grant thanks for the response.
A couple of other people have recommended trying the Nutch + Solr
approach, but I am not sure what the real benefit of doing that is.
Since Nutch provides most of the same features as Solr and Solr has
some nice additional features (like spell checking, incre
On Oct 22, 2008, at 7:57 AM, John Martyniak wrote:
I am very new to Solr, but I have played with Nutch and Lucene.
Has anybody used Solr for a whole web indexing application?
Which Spider did you use?
How does it compare to Nutch?
There is a patch that combines Nutch + Solr. Nutch is used
I am very new to Solr, but I have played with Nutch and Lucene.
Has anybody used Solr for a whole web indexing application?
Which Spider did you use?
How does it compare to Nutch?
Thanks in advance for all of the info.
-John