I'm just getting my feet wet with Rolr. I am having trouble with posting a web crawl. I get the following:
C:\Users\bruce.campbell\Downloads\solr-8.6.3\solr-8.6.3\example\exampledocs> java -Ddata=web -Dc=solr -jar post.jar http://www.lucene.apache.org/ SimplePostTool version 5.0.0 Posting web pages to Solr url http://localhost:8983/solr/solr/update/extract Entering auto mode. Indexing pages with content-types corresponding to file endings xml,json,jsonl,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log Entering crawl at level 0 (1 links total, 1 new) SimplePostTool: WARNING: IOException when reading page from url http://www.lucene.apache.org: www.lucene.apache.org SimplePostTool: WARNING: The URL http://www.lucene.apache.org returned a HTTP result status of 404 0 web pages indexed. COMMITting Solr index changes to http://localhost:8983/solr/solr/update/extract... SimplePostTool: WARNING: Solr returned an error #404 (Not Found) for url: http://localhost:8983/solr/solr/update/extract?commit=true SimplePostTool: WARNING: Response: <html> <head> <meta http-equiv="Content-Type" content="text/html;charset=utf-8"/> <title>Error 404 Not Found</title> </head> <body><h2>HTTP ERROR 404 Not Found</h2> <table> <tr><th>URI:</th><td>/solr/solr/update/extract</td></tr> <tr><th>STATUS:</th><td>404</td></tr> <tr><th>MESSAGE:</th><td>Not Found</td></tr> <tr><th>SERVLET:</th><td>default</td></tr> </table> </body> </html> Thanks you in advance.