Re: HTTP ERROR: 404 missing core name in path after integrating nutch
Just wanted to give an update on my efforts. I installed the Feb. 26 update this morning. Was able to access /solr/admin. Copied over the nutch schema.xml. restarted solr and was able to access /solr/admin Edited solrconfig.xml to add the nutch requesthandler snippet from lucidimagination. Resta
Generating a sitemap
Been testing nutch to crawl for solr and I was wondering if anyone had already worked on a system for getting the urls out of solr and generating an XML sitemap for Google.