Re: solr nutch url indexing

2009-08-26 Thread Uri Boness
Do you mean the schema or the solrconfig.xml? The request handler is configured in the solrconfig.xml and you can find out more about this particular configuration in http://wiki.apache.org/solr/DisMaxRequestHandler?highlight=(CategorySolrRequestHandler)|((CategorySolrRequestHandler)). To u

Re: solr nutch url indexing

2009-08-25 Thread last...@gmail.com
Uri Boness wrote: Well... yes, it's a tool the Nutch ships with. It also ships with an example Solr schema which you can use. hi, is there any documentation to understand what going in the schema ? dismax explicit 0.01 content0.5 anchor1.0 title5.2 cont

Re: solr nutch url indexing

2009-08-25 Thread Uri Boness
Well... yes, it's a tool the Nutch ships with. It also ships with an example Solr schema which you can use. Fuad Efendi wrote: Thanks for the link, so, SolrIndex is NOT plugin, it is an application... I use similar approach... -Original Message- From: Uri Boness Hi, Nutch comes with

RE: solr nutch url indexing

2009-08-25 Thread Fuad Efendi
Thanks for the link, so, SolrIndex is NOT plugin, it is an application... I use similar approach... -Original Message- From: Uri Boness Hi, Nutch comes with support for Solr out of the box. I suggest you follow the steps as described here: http://www.lucidimagination.com/blog/2009/03/0

Re: solr nutch url indexing

2009-08-25 Thread Uri Boness
It seems to me that this configuration actually does what you want - queries on "title" mostly. The default search field doesn't influence a dismax query. I would suggest you to include the debugQuery=true parameter, it will help you figure out how the matching is performed. You can read more

Re: solr nutch url indexing

2009-08-25 Thread Thibaut Lassalle
Thanks for your help. I use the default Nutch configuration and I use solrindex to give the Nutch result to Solr. I have results when I query therefore Nutch works properly (it gives a url, title, content ...) I would like to query on Solr to emphase the "title" field and not the "content" field.

Re: solr nutch url indexing

2009-08-24 Thread Uri Boness
[mailto:ubon...@gmail.com] Sent: August-24-09 4:42 PM To: solr-user@lucene.apache.org Subject: Re: solr nutch url indexing How did you configure nutch? Make sure you have the "parse-html" and "index-basic" configured. The HtmlParser should by default extract the page title and add

RE: solr nutch url indexing

2009-08-24 Thread Fuad Efendi
Is SolrIndex plugin for Nutch? Thanks! -Original Message- From: Uri Boness [mailto:ubon...@gmail.com] Sent: August-24-09 4:42 PM To: solr-user@lucene.apache.org Subject: Re: solr nutch url indexing How did you configure nutch? Make sure you have the "parse-html" and &q

Re: solr nutch url indexing

2009-08-24 Thread Uri Boness
How did you configure nutch? Make sure you have the "parse-html" and "index-basic" configured. The HtmlParser should by default extract the page title and add to the parsed data, and the BasicIndexingFilter by default adds this title to the NutchDocument and stores it in the "title" filed. All