Thank you for the feedback, Otis. Yes, I thought that such an approach is usefull if the number of pages to crawl is relatively low.
However, what about using solr + nutch? Exists the problem that this would not scale, if the index becomes too large, up to now? What about extending nutch with features such as the DisMaxRequestHandler, is the amount of work larger than it would be in Solr? The big pro of Solr is that I can enhance the whole thing in a few minutes, if I need more extra-information to improve the search. That makes it very easy to experiment with boostings, filters etc. As far as I know, Nutch does not offer such greatefull features. Do you know a little bit more about that? Probably I should ask such question at the Nutch-mailing list, but at the moment I hope that I can achieve as much as I can with Solr, because I have no experiences with Hadoop but Nutch seems to require it. Thank you! - Mitch -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-and-Nutch-Droids-to-use-or-not-to-use-tp900069p900480.html Sent from the Solr - User mailing list archive at Nabble.com.