The error I have been receiving after crawling using Solr is as mentioned
below:
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Basic Indexing
Filter (index-basic)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Basic Summarizer
Plug-in (summary-basic)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Site Query
Filter
(query-site)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Http / Https
Protocol Plug-in (protocol-httpclient)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - HTTP Framework
(lib-http)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Pass-through URL
Normalizer (urlnormalizer-pass)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Regex URL Filter
(urlfilter-regex)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Http Protocol
Plug-in (protocol-http)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - XML Response
Writer
Plug-in (response-xml)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Regex URL
Normalizer (urlnormalizer-regex)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - OPIC Scoring
Plug-in (scoring-opic)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - CyberNeko HTML
Parser (lib-nekohtml)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Anchor Indexing
Filter (index-anchor)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - URL Query Filter
(query-url)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Regex URL Filter
Framework (lib-regex-filter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - JSON Response
Writer Plug-in (response-json)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Registered
Extension-Points:
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Summarizer
(org.apache.nutch.searcher.Summarizer)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Protocol
(org.apache.nutch.protocol.Protocol)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Analysis
(org.apache.nutch.analysis.NutchAnalyzer)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Field
Filter
(org.apache.nutch.indexer.field.FieldFilter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - HTML Parse
Filter
(org.apache.nutch.parse.HtmlParseFilter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Query
Filter
(org.apache.nutch.searcher.QueryFilter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Search
Results Response Writer (org.apache.nutch.searcher.response.ResponseWriter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch URL
Normalizer (org.apache.nutch.net.URLNormalizer)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch URL Filter
(org.apache.nutch.net.URLFilter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Online
Search
Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Indexing
Filter (org.apache.nutch.indexer.IndexingFilter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Content
Parser (org.apache.nutch.parse.Parser)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Nutch Scoring
(org.apache.nutch.scoring.ScoringFilter)
2011-08-24 15:47:56,225 INFO plugin.PluginRepository - Ontology Model
Loader (org.apache.nutch.ontology.Ontology)
2011-08-24 15:47:56,241 INFO indexer.IndexingFilters - Adding
org.apache.nutch.indexer.basic.BasicIndexingFilter
2011-08-24 15:47:56,241 INFO indexer.IndexingFilters - Adding
org.apache.nutch.indexer.anchor.AnchorIndexingFilter
2011-08-24 15:47:57,366 WARN mapred.LocalJobRunner - job_local_0001
org.apache.solr.common.SolrException: Internal Server Error
Internal Server Error
request: http://localhost:7001/solr/update?wt=javabin&version=2.2
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:343)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:183)
at
org.apache.solr.client.solrj.request.UpdateRequest.process(UpdateRequest.java:217)
at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:48)
at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:69)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:447)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:170)
2011-08-24 15:47:57,882 FATAL solr.SolrIndexer - SolrIndexer:
java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1232)
at
org.apache.nutch.indexer.solr.SolrIndexer.indexSolr(SolrIndexer.java:73)