Hi,

I’m trying to integrate Solr with Nutch and I performed all of the necessary 
steps except after Nutch performs the crawl it appears that I’m receiving a 
connection refused.

2014-03-24 11:42:43,062 INFO  indexer.IndexerMapReduce - IndexerMapReduce: 
crawldb: TestCrawl/crawldb
2014-03-24 11:42:43,062 INFO  indexer.IndexerMapReduce - IndexerMapReduce: 
linkdb: TestCrawl/linkdb
2014-03-24 11:42:43,062 INFO  indexer.IndexerMapReduce - IndexerMapReduces: 
adding segment: TestCrawl/segments/20140324113941
2014-03-24 11:42:43,304 WARN  util.NativeCodeLoader - Unable to load 
native-hadoop library for your platform... using builtin-java classes where 
applicable
2014-03-24 11:42:43,942 INFO  anchor.AnchorIndexingFilter - Anchor 
deduplication is: off
2014-03-24 11:42:44,456 INFO  indexer.IndexWriters - Adding 
org.apache.nutch.indexwriter.solr.SolrIndexWriter
2014-03-24 11:42:44,465 INFO  solr.SolrUtils - Authenticating as: <my username>
2014-03-24 11:42:44,483 INFO  solr.SolrMappingReader - source: content dest: 
content
2014-03-24 11:42:44,483 INFO  solr.SolrMappingReader - source: title dest: title
2014-03-24 11:42:44,483 INFO  solr.SolrMappingReader - source: host dest: host
2014-03-24 11:42:44,483 INFO  solr.SolrMappingReader - source: segment dest: 
segment
2014-03-24 11:42:44,483 INFO  solr.SolrMappingReader - source: boost dest: boost
2014-03-24 11:42:44,484 INFO  solr.SolrMappingReader - source: digest dest: 
digest
2014-03-24 11:42:44,484 INFO  solr.SolrMappingReader - source: tstamp dest: 
tstamp
2014-03-24 11:42:44,484 INFO  solr.SolrMappingReader - source: url dest: id
2014-03-24 11:42:44,484 INFO  solr.SolrMappingReader - source: url dest: url
2014-03-24 11:42:44,616 INFO  solr.SolrIndexWriter - Indexing 22 documents
2014-03-24 11:42:44,704 INFO  httpclient.HttpMethodDirector - I/O exception 
(java.net.ConnectException) caught when processing request: Connection refused
2014-03-24 11:42:44,704 INFO  httpclient.HttpMethodDirector - Retrying request
2014-03-24 11:42:44,707 INFO  httpclient.HttpMethodDirector - I/O exception 
(java.net.ConnectException) caught when processing request: Connection refused
2014-03-24 11:42:44,707 INFO  httpclient.HttpMethodDirector - Retrying request
2014-03-24 11:42:44,707 INFO  httpclient.HttpMethodDirector - I/O exception 
(java.net.ConnectException) caught when processing request: Connection refused
2014-03-24 11:42:44,707 INFO  httpclient.HttpMethodDirector - Retrying request
2014-03-24 11:42:44,708 INFO  solr.SolrIndexWriter - Indexing 22 documents
2014-03-24 11:42:44,709 INFO  httpclient.HttpMethodDirector - I/O exception 
(java.net.ConnectException) caught when processing request: Connection refused
2014-03-24 11:42:44,709 INFO  httpclient.HttpMethodDirector - Retrying request
2014-03-24 11:42:44,709 INFO  httpclient.HttpMethodDirector - I/O exception 
(java.net.ConnectException) caught when processing request: Connection refused
2014-03-24 11:42:44,709 INFO  httpclient.HttpMethodDirector - Retrying request
2014-03-24 11:42:44,709 INFO  httpclient.HttpMethodDirector - I/O exception 
(java.net.ConnectException) caught when processing request: Connection refused
2014-03-24 11:42:44,709 INFO  httpclient.HttpMethodDirector - Retrying request
2014-03-24 11:42:44,715 WARN  mapred.LocalJobRunner - job_local319933392_0001
java.io.IOException
        at 
org.apache.nutch.indexwriter.solr.SolrIndexWriter.makeIOException(SolrIndexWriter.java:173)
        at 
org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:159)
        at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
        at 
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
        at 
org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
        at 
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
Caused by: org.apache.solr.client.solrj.SolrServerException: 
java.net.ConnectException: Connection refused
        at 
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:478)
        at 
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
        at 
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
        at 
org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
        ... 6 more
Caused by: java.net.ConnectException: Connection refused
        at java.net.PlainSocketImpl.socketConnect(Native Method)
        at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:351)
        at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:213)
        at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:200)
        at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
        at java.net.Socket.connect(Socket.java:529)
        at java.net.Socket.connect(Socket.java:478)
        at java.net.Socket.<init>(Socket.java:375)
        at java.net.Socket.<init>(Socket.java:249)
        at 
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
        at 
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
        at 
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
        at 
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
        at 
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
        at 
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
        at 
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
        at 
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
        ... 9 more
2014-03-24 11:42:45,705 ERROR indexer.IndexingJob - Indexer: 
java.io.IOException: Job failed!


My solr instance is installed on Tomcat and it’s protected using 
tomcat-users.xml. I read that I should change the nutch-default.xml file with 
the authentication properties:

solr.server.url
solr.auth
solr.auth.username
solr.auth.password

It appears that my username is being attempted however it still seems that the 
connection is refused. 

Any ideas?

Thanks in advance,
Laura

Reply via email to