Peter, I was playing with Nutch for quite some time before Solr, so I know Nutch better than Solr. Nutch has a plugin mechanism so that you can add a parser for a document type. It comes with parser plugins for most popular doc types (with varying degrees of international text support).
My question was really: can Solr be a replacement of Nutch? And I was assuming that there got to be a Solr client that crawls, parses, convert to XML, and feed to Solr indexer via HTTP. -kuro