Hi, Solr is a search engine, not a crawler. You can use Apache Nutch to crawl your site and have it indexed in Solr.
Cheers, > Hi, > > I am new to Solr/Lucene, and have some problems trying to figure out the > best way to perform indexing. I think I understand the general principles, > but have some trouble translating this to my specific goal, which is the > following: > > I want to use SolR as a search engine based on general (English) keywords, > that has indexed Wikipedia for Schools > (http://www.soschildrensvillages.org.uk/charity-news/archive/2008/10/2008- > wikipedia-for-schools). > > I initially thought that it would be sufficient to add the root document > (index.html) to Solr, after which everything would be automagically > indexed, but this does not seem to work. I have also tried to use > urldatasource in data-config.xml, but there I get a bit confused by the > settings. > > Could anyone help me understand how I can achieve my goal? > > Thanks > > Kees