If you're using Solr anyway, you'd better upgrade to Nutch 1.3 with Solr 3.x 
support.

> Works like a charm.
> 
> Thanks,
> 
> Leo
> 
> On Wed, 2011-07-13 at 11:31 +0530, Geek Gamer wrote:
> > you need to update the solrj libs to 3.x version. the java bin format
> > has changed .
> > I made the change a few months back, you can pull the changes from
> > https://github.com/geek4377/nutch/tree/geek5377-1.2.1
> > 
> > hope that helps,
> > 
> > 
> > On Wed, Jul 13, 2011 at 8:58 AM, Leo Subscriptions
> > 
> > <llsub...@zudiewiener.com> wrote:
> > > I'm running 64bit Ubuntu 11.04, nutch 1.2, solr 3.3 (downloaded, not
> > > built) and tomcat6 following this (and some other) links
> > > http://wiki.apache.org/nutch/RunningNutchAndSolr
> > > 
> > > I have added the nutch schema and can access/view this schema via the
> > > admin page. nutch also works as I can perfrom successful searches.
> > > 
> > > When I execute the following:
> > >>> ./bin/nutch solrindex http://localhost:8080/solr/core0 crawl/crawldb
> > > 
> > > crawl/linkdb crawl/segments/*
> > > 
> > > I (eventually) get an io error.
> > > 
> > > Tha above command creates the following
> > > files /var/lib/tomcat6/solr/core0/data/index/
> > > 
> > > -------------------------------
> > > 544 -rw-r--r-- 1 tomcat6 tomcat6 557056 2011-07-13 11:09 _1.fdt
> > > 
> > >  0 -rw-r--r-- 1 tomcat6 tomcat6      0 2011-07-13 11:00 _1.fdx
> > >  4 -rw-r--r-- 1 tomcat6 tomcat6     32 2011-07-13 10:59 segments_2
> > >  4 -rw-r--r-- 1 tomcat6 tomcat6     20 2011-07-13 10:59 segments.gen
> > >  0 -rw-r--r-- 1 tomcat6 tomcat6      0 2011-07-13 11:00 write.lock
> > > 
> > > -------------------------------
> > > 
> > > but the hadoop.log reports the following error
> > > 
> > > ---------------------------
> > > 2011-07-13 11:09:47,665 INFO  indexer.IndexingFilters - Adding
> > > org.apache.nutch.indexer.basic.BasicIndexingFilter
> > > 2011-07-13 11:09:47,666 INFO  indexer.IndexingFilters - Adding
> > > org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: content
> > > dest: content
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: site
> > > dest: site
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: title
> > > dest: title
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: host
> > > dest: host
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: segment
> > > dest: segment
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: boost
> > > dest: boost
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: digest
> > > dest: digest
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: tstamp
> > > dest: tstamp
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: url
> > > dest: id
> > > 2011-07-13 11:09:47,690 INFO  solr.SolrMappingReader - source: url
> > > dest: url
> > > 2011-07-13 11:09:49,272 WARN  mapred.LocalJobRunner - job_local_0001
> > > java.lang.RuntimeException: Invalid version or the data in not in
> > > 'javabin' format
> > > 
> > >        at
> > > 
> > > org.apache.solr.common.util.JavaBinCodec.unmarshal(JavaBinCodec.java:99
> > > )
> > > 
> > >        at
> > > 
> > > org.apache.solr.client.solrj.impl.BinaryResponseParser.processResponse(
> > > BinaryResponseParser.java:39)
> > > 
> > >        at
> > > 
> > > org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(Commons
> > > HttpSolrServer.java:466)
> > > 
> > >        at
> > > 
> > > org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(Commons
> > > HttpSolrServer.java:243)
> > > 
> > >        at
> > > 
> > > org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(Abst
> > > ractUpdateRequest.java:105)
> > > 
> > >        at
> > > 
> > > org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
> > > 
> > >        at
> > > 
> > > org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:64)
> > > 
> > >        at org.apache.nutch.indexer.IndexerOutputFormat
> > > 
> > > $1.write(IndexerOutputFormat.java:54)
> > > 
> > >        at org.apache.nutch.indexer.IndexerOutputFormat
> > > 
> > > $1.write(IndexerOutputFormat.java:44)
> > > 
> > >        at org.apache.hadoop.mapred.ReduceTask
> > > 
> > > $3.collect(ReduceTask.java:440)
> > > 
> > >        at
> > > 
> > > org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:
> > > 159)
> > > 
> > >        at
> > > 
> > > org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:
> > > 50)
> > > 
> > >        at
> > > 
> > > org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:463)
> > > 
> > >        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
> > >        at org.apache.hadoop.mapred.LocalJobRunner
> > > 
> > > $Job.run(LocalJobRunner.java:216)
> > > 2011-07-13 11:09:49,611 ERROR solr.SolrIndexer - java.io.IOException:
> > > Job failed!
> > > -----------------------------------------------------------------------
> > > -----------------------------------------------------------------------
> > > -
> > > 
> > > I'd appreciate any help with this.
> > > 
> > > Thanks,
> > > 
> > > Leo

Reply via email to