Re: HTTP ERROR: 404 missing core name in path after integrating nutch

2010-02-26 Thread Ian Evans
Just wanted to give an update on my efforts. I installed the Feb. 26 update this morning. Was able to access /solr/admin. Copied over the nutch schema.xml. restarted solr and was able to access /solr/admin Edited solrconfig.xml to add the nutch requesthandler snippet from lucidimagination. Resta

Generating a sitemap

2010-03-10 Thread Ian Evans
Been testing nutch to crawl for solr and I was wondering if anyone had already worked on a system for getting the urls out of solr and generating an XML sitemap for Google.