Hi Erick, The term "newspaper latimes" is not found in latimes.com. However, google places it in the first place. My guess is that mm parameter must not be set as 2<-1 in order to achieve google.com like ranking for two word phrase queries.
My goal is to set mm parameter in such a way that latimes.com will be ranked in 1-3rd places and sites with both words will be placed after them. As I wrote in my previous letter setting mm as 1<-1 solves this issue partially. Problem in this case is that sites with both words are placed at the bottom or are not in the search results at all. Thanks. Alex. -----Original Message----- From: Erick Erickson <erickerick...@gmail.com> To: solr-user <solr-user@lucene.apache.org> Sent: Sat, Nov 5, 2011 9:01 am Subject: Re: how to achieve google.com like results for phrase queries First, the default query operator is ignored by edismax, so that's not doing anything. Why would you expect "newspaper latimes" to be found at all in "latimes.com"? What proof do you have that the two terms are even in the "latimes.com" document? You can look at the Query Elevation Component to force certain known documents to the top of the results based on the search terms, but that's not a very elegant solution. What business requirement are you trying to accomplish here? Because as asked, there's really not enough information to provide a meaningful suggestion. Best Erick On Thu, Nov 3, 2011 at 7:30 PM, <alx...@aim.com> wrote: > Hello, > > I use nutch-1.3 crawled results in solr-3.4. I noticed that for two word phrases like newspaper latimes, latimes.com is not in results at all. > This may be due to the dismax def type that I use in request handler > > <str name="defType">dismax</str> > <str name="qf">url^1.5 id^1.5 content^ title^1.2</str> > <str name="pf">url^1.5 id^1.5 content^0.5 title^1.2</str> > > > with mm as > <str name="mm">2<-1 5<-2 6<90%</str> > > However, changing it to > <str name="mm">1<-1 2<-1 5<-2 6<90%</str> > > and q.op to OR or AND > > do not solve the problem. In this case latimes.com is ranked higher, but > still is not in the first place. > Also in this case results with both words are ranked very low, almost at the end. > > We need to be able to achieve the case when latimes.com is placed in the > first place then results with both words and etc. > > Any ideas how to modify config to this end? > > Thanks in advance. > Alex. > >