On Apr 3, 2010, at 10:18 AM, MitchK wrote:

> 
> Hello,
> 
> I want to tinkle a little bit with Solr, so I need a little feedback:
> Is it possible to define a Minimum Should Match for the document itself?
> 
> I mean, it is possible to say, that a query "this is my query" should only
> match a document, if the document matches 3 of the four queried terms.
> 
> However, I am searching for a solution that does something like: "this is my
> query" and the document has to consist of this query plus maximal - for
> example - two another terms?
> 
> Example:
> Query: "this is my query"
> Doc1: "this is my favorite query"
> Doc2: "I am searching for a lot of stuff, so this is my query"
> Doc2: "I'd like to say: this is my query"
> 
> Saying that maximal two another terms should occur in the document, Solr
> should response only doc1.
> If this is not possible out-of-the-box, I think one has to work with
> TermVectors, am I right?

Not quite following.  It sounds like you are saying you want to favor docs that 
are shorter, while still maximizing the number of terms that match, right?

You might look at the Similarity class and the SimilarityFactory as well in the 
Solr/Lucene code.

> 
> I think it's possible to do so outside of Lucene/Solr by aking the response
> of the TermVectorsComponent and filtering the result-list. But I'd like to
> integrate this into Lucene/Solr itself.
> Any ideas which components I have to customize? 
> 
> At the moment I am speculating that I have to customize the class which is
> collecting the result, before it is passing it to the ResponseWriter. 
> 
> Kind regards
> - Mitch
> -- 
> View this message in context: 
> http://n3.nabble.com/Minimum-Should-Match-the-other-way-round-tp694867p694867.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to