Thanks Otis. I've just downloaded NUTCH-442_v8.patch<https://issues.apache.org/jira/secure/attachment/12391810/NUTCH-442_v8.patch>from https://issues.apache.org/jira/browse/NUTCH-442, but the patching process gave me lots errors, see below:
*webby88 /opt/tomcat6/webapps/nutch:patch < NUTCH-442_v8.patch* The next patch would delete the file TestDistributedSearch.java, which does not exist! Assume -R? [n] Apply anyway? [n] y can't find file to patch at input line 5 Perhaps you should have used the -p or --strip option? The text leading up to this was: -------------------------- |Index: src/test/org/apache/nutch/searcher/TestDistributedSearch.java |=================================================================== |--- src/test/org/apache/nutch/searcher/TestDistributedSearch.java (revision 701044) |+++ src/test/org/apache/nutch/searcher/TestDistributedSearch.java (working copy) -------------------------- File to patch: Skip this patch? [y] n File to patch: src/test/org/apache/nutch/searcher/TestDistributedSearch.java patching file src/test/org/apache/nutch/searcher/TestDistributedSearch.java *can't find file to patch at input line 154* Perhaps you should have used the -p or --strip option? The text leading up to this was: -------------------------- |Index: src/test/org/apache/nutch/indexer/TestIndexingFilters.java |=================================================================== |--- src/test/org/apache/nutch/indexer/TestIndexingFilters.java (revision 701044) |+++ src/test/org/apache/nutch/indexer/TestIndexingFilters.java (working copy) -------------------------- File to patch: *src/test/org/apache/nutch/indexer/TestIndexingFilters.java* I wonder if my way of applying this patch is correct or not. Could you please give me some correction if I did wrong? My system is CentOS 5.2 by the way. Thanks! Tony On Sun, Dec 28, 2008 at 10:18 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > Tony, > > I think you should ignore the advice/code from foofactory blog and just go > with NUTCH-442, as that's most likely going to result in "the official" > Nutch-Solr integration. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > ----- Original Message ---- > > From: Tony Wang <ivyt...@gmail.com> > > To: solr-user@lucene.apache.org > > Sent: Saturday, December 27, 2008 2:46:12 PM > > Subject: Re: Please help me integrate Nutch with Solr > > > > Hi Dingding, > > > > Thanks for the message. I followed this tutorial here > > > http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html > , > > trying to integrate Nutch with Solr. Right now, my Nutch installation is > > successful (see http://208.64.71.46:8080, use keywords like DNA or > protein, > > etc) > > > > I am not sure if I understand you right. Should I apply that patch from > the > > link you gave me also? I use the latest nightly builds of Nutch and Solr. > > The jsp servlet container is Tomcat6 on my CentOS 5.2 box. > > > > Thanks again. > > > > tony > > > > On Sat, Dec 27, 2008 at 12:38 PM, Dingding Ye wrote: > > > > > Hi, Wang. > > > > > > Solr should be act as the backend indexing server for Nutch instead of > the > > > lucene index. > > > > > > You have to patch the nutch at first. > > > > > > Check this list: > > > > > > > > > > > > https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12607005#action_12607005 > > > > > > On Sun, Dec 28, 2008 at 3:31 AM, Tony Wang wrote: > > > > > > > yes, I kind of jumped in the middle of the learning curve which seems > to > > > be > > > > rather deep to me. > > > > > > > > I've tried to find/read many tutorials so far, it seems to me that > data > > > is > > > > added to Solr via XML documents. I wonder how to convert Nutch's > indexing > > > > data to XML that could be added to Solr. Or is there other simpler > ways > > > of > > > > achieving the conversion? > > > > > > > > I just watched this slide > > > > http://www.slideshare.net/pittaya/using-apache-solr > > > > > > > > Thanks! > > > > > > > > Tony > > > > > > > > On Fri, Dec 26, 2008 at 11:25 AM, Otis Gospodnetic < > > > > otis_gospodne...@yahoo.com> wrote: > > > > > > > > > Tony, > > > > > > > > > > You started an example Solr application. This application has no > > > > knowledge > > > > > of your Nutch setup and the Lucene index that you've presumably > built > > > > with > > > > > Nutch. The simplest thing you can do to get something to "work" is > to > > > > > locate your Lucene/Nutch index and copy all of its contents to > > > > > solr/example/solr/data/index/ directory. You will need to adjust > the > > > > schema > > > > > to match the Lucene/Nutch index fields, too. > > > > > > > > > > But honestly, it looks like you are "starting from the middle" > without > > > > > really following things step-by-step and without really > understanding > > > > either > > > > > Nutch or Solr. My suggestion is to first play only with Nutch and > > > learn > > > > how > > > > > to run various Nutch steps, all the way to generating an index. > Then > > > > play > > > > > with Solr (and forget about Nutch) by following the Solr tutorial. > > > Once > > > > you > > > > > get Solr by itself working, you will understand how the Nutch and > Solr > > > > > combination can work together. > > > > > > > > > > Otis > > > > > -- > > > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > > > > > > > > > > > > > > > > ----- Original Message ---- > > > > > > From: Tony Wang > > > > > > To: solr-user@lucene.apache.org > > > > > > Sent: Friday, December 26, 2008 12:10:39 PM > > > > > > Subject: Re: Please help me integrate Nutch with Solr > > > > > > > > > > > > Thanks for the reply, I appreciate it. > > > > > > > > > > > > Basically, I downloaded the latest nightly build of Solr --> > unzip it > > > > --> > > > > > > shut down Tomcat -->copy the solr.war file to the Tomcat webapps > > > > > container > > > > > > (/opt/tomcat6/webapps) --> start Tomcat --> set up the schema.xml > > > file > > > > > > located in /opt/tomcat6/webapps/solr/example/solr/conf. > > > > > > > > > > > > After all of these above operations, I go to the directory > > > > > > /opt/tomcat6/webapps/solr/example and run 'java -jar start.jar' > and > > > the > > > > > port > > > > > > 8983 is opened. See my solr in action > http://208.64.71.46:8983/solr/ > > > , > > > > > you > > > > > > can do a search from there or in the admin panel. > > > > > > > > > > > > If what I did was wrong, could you please give me some > instruction? I > > > > > > appreciate it! > > > > > > > > > > > > Happy holidays! > > > > > > > > > > > > Tony > > > > > > > > > > > > > > > > > > On Fri, Dec 26, 2008 at 9:45 AM, Otis Gospodnetic < > > > > > > otis_gospodne...@yahoo.com> wrote: > > > > > > > > > > > > > Hi Tony, > > > > > > > > > > > > > > It looks like you already asked on the Nutch list. Maybe just > > > stick > > > > to > > > > > one > > > > > > > list? > > > > > > > It's impossible to tell what went wrong from the information > you > > > > > provided > > > > > > > below. Are there any errors in the log? Are you sure your > solr > > > home > > > > > is set > > > > > > > correctly? > > > > > > > > > > > > > > Otis > > > > > > > -- > > > > > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > > > > > > > > > > > > > > > > > > > > > > > > ----- Original Message ---- > > > > > > > > From: Tony Wang > > > > > > > > To: solr-user@lucene.apache.org > > > > > > > > Sent: Friday, December 26, 2008 11:20:06 AM > > > > > > > > Subject: Please help me integrate Nutch with Solr > > > > > > > > > > > > > > > > I got the web interface to work at here > > > > > > > > http://208.64.71.46:8080/search.jsp?lang=en&query=tony , but > my > > > > > search > > > > > > > query > > > > > > > > in Solr doesn't seem to work correctly. There is no any > result > > > > > retrieved > > > > > > > > from > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > http://208.64.71.46:8983/solr/select?q=tony&start=0&rows=10&fl=title%2Curl%2Cscore&qt=standard&wt=standard&hl=on&hl.fl=content > > > > > > > > > > > > > > > > I followed this guide to integrate Nutch with Solr > > > > > > > > > > > > > > > > > > > > > > > > > > > > http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html > > > > > > > . > > > > > > > > I wonder what could be wrong with my integration. > > > > > > > > > > > > > > > > I use CentOS 5.2, Tomcat6 and Nutch & Solr latest nightly > builds. > > > > > > > > > > > > > > > > Thanks! > > > > > > > > > > > > > > > > Tony > > > > > > > > > > > > > > > > -- > > > > > > > > Signature: Success is a journey that never ends. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > Signature: Success is a journey that never ends. > > > > > > > > > > > > > > > > > > > > > > -- > > > > Signature: Success is a journey that never ends. > > > > > > > > > > > > > > > -- > > Signature: Success is a journey that never ends. > > -- Signature: Success is a journey that never ends.