Tony, I think you should ignore the advice/code from foofactory blog and just go with NUTCH-442, as that's most likely going to result in "the official" Nutch-Solr integration.
Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: Tony Wang <ivyt...@gmail.com> > To: solr-user@lucene.apache.org > Sent: Saturday, December 27, 2008 2:46:12 PM > Subject: Re: Please help me integrate Nutch with Solr > > Hi Dingding, > > Thanks for the message. I followed this tutorial here > http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html, > trying to integrate Nutch with Solr. Right now, my Nutch installation is > successful (see http://208.64.71.46:8080, use keywords like DNA or protein, > etc) > > I am not sure if I understand you right. Should I apply that patch from the > link you gave me also? I use the latest nightly builds of Nutch and Solr. > The jsp servlet container is Tomcat6 on my CentOS 5.2 box. > > Thanks again. > > tony > > On Sat, Dec 27, 2008 at 12:38 PM, Dingding Ye wrote: > > > Hi, Wang. > > > > Solr should be act as the backend indexing server for Nutch instead of the > > lucene index. > > > > You have to patch the nutch at first. > > > > Check this list: > > > > > > > https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12607005#action_12607005 > > > > On Sun, Dec 28, 2008 at 3:31 AM, Tony Wang wrote: > > > > > yes, I kind of jumped in the middle of the learning curve which seems to > > be > > > rather deep to me. > > > > > > I've tried to find/read many tutorials so far, it seems to me that data > > is > > > added to Solr via XML documents. I wonder how to convert Nutch's indexing > > > data to XML that could be added to Solr. Or is there other simpler ways > > of > > > achieving the conversion? > > > > > > I just watched this slide > > > http://www.slideshare.net/pittaya/using-apache-solr > > > > > > Thanks! > > > > > > Tony > > > > > > On Fri, Dec 26, 2008 at 11:25 AM, Otis Gospodnetic < > > > otis_gospodne...@yahoo.com> wrote: > > > > > > > Tony, > > > > > > > > You started an example Solr application. This application has no > > > knowledge > > > > of your Nutch setup and the Lucene index that you've presumably built > > > with > > > > Nutch. The simplest thing you can do to get something to "work" is to > > > > locate your Lucene/Nutch index and copy all of its contents to > > > > solr/example/solr/data/index/ directory. You will need to adjust the > > > schema > > > > to match the Lucene/Nutch index fields, too. > > > > > > > > But honestly, it looks like you are "starting from the middle" without > > > > really following things step-by-step and without really understanding > > > either > > > > Nutch or Solr. My suggestion is to first play only with Nutch and > > learn > > > how > > > > to run various Nutch steps, all the way to generating an index. Then > > > play > > > > with Solr (and forget about Nutch) by following the Solr tutorial. > > Once > > > you > > > > get Solr by itself working, you will understand how the Nutch and Solr > > > > combination can work together. > > > > > > > > Otis > > > > -- > > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > > > > > > > > > > > > ----- Original Message ---- > > > > > From: Tony Wang > > > > > To: solr-user@lucene.apache.org > > > > > Sent: Friday, December 26, 2008 12:10:39 PM > > > > > Subject: Re: Please help me integrate Nutch with Solr > > > > > > > > > > Thanks for the reply, I appreciate it. > > > > > > > > > > Basically, I downloaded the latest nightly build of Solr --> unzip it > > > --> > > > > > shut down Tomcat -->copy the solr.war file to the Tomcat webapps > > > > container > > > > > (/opt/tomcat6/webapps) --> start Tomcat --> set up the schema.xml > > file > > > > > located in /opt/tomcat6/webapps/solr/example/solr/conf. > > > > > > > > > > After all of these above operations, I go to the directory > > > > > /opt/tomcat6/webapps/solr/example and run 'java -jar start.jar' and > > the > > > > port > > > > > 8983 is opened. See my solr in action http://208.64.71.46:8983/solr/ > > , > > > > you > > > > > can do a search from there or in the admin panel. > > > > > > > > > > If what I did was wrong, could you please give me some instruction? I > > > > > appreciate it! > > > > > > > > > > Happy holidays! > > > > > > > > > > Tony > > > > > > > > > > > > > > > On Fri, Dec 26, 2008 at 9:45 AM, Otis Gospodnetic < > > > > > otis_gospodne...@yahoo.com> wrote: > > > > > > > > > > > Hi Tony, > > > > > > > > > > > > It looks like you already asked on the Nutch list. Maybe just > > stick > > > to > > > > one > > > > > > list? > > > > > > It's impossible to tell what went wrong from the information you > > > > provided > > > > > > below. Are there any errors in the log? Are you sure your solr > > home > > > > is set > > > > > > correctly? > > > > > > > > > > > > Otis > > > > > > -- > > > > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > > > > > > > > > > > > > > > > > > > > ----- Original Message ---- > > > > > > > From: Tony Wang > > > > > > > To: solr-user@lucene.apache.org > > > > > > > Sent: Friday, December 26, 2008 11:20:06 AM > > > > > > > Subject: Please help me integrate Nutch with Solr > > > > > > > > > > > > > > I got the web interface to work at here > > > > > > > http://208.64.71.46:8080/search.jsp?lang=en&query=tony , but my > > > > search > > > > > > query > > > > > > > in Solr doesn't seem to work correctly. There is no any result > > > > retrieved > > > > > > > from > > > > > > > > > > > > > > > > > > > > > > > > > > > > http://208.64.71.46:8983/solr/select?q=tony&start=0&rows=10&fl=title%2Curl%2Cscore&qt=standard&wt=standard&hl=on&hl.fl=content > > > > > > > > > > > > > > I followed this guide to integrate Nutch with Solr > > > > > > > > > > > > > > > > > > > > > > http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html > > > > > > . > > > > > > > I wonder what could be wrong with my integration. > > > > > > > > > > > > > > I use CentOS 5.2, Tomcat6 and Nutch & Solr latest nightly builds. > > > > > > > > > > > > > > Thanks! > > > > > > > > > > > > > > Tony > > > > > > > > > > > > > > -- > > > > > > > Signature: Success is a journey that never ends. > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > Signature: Success is a journey that never ends. > > > > > > > > > > > > > > > > > -- > > > Signature: Success is a journey that never ends. > > > > > > > > > -- > Signature: Success is a journey that never ends.