The AND default has one big problem. If the user misspells a single word, they 
get no results. About 10% of queries are misspelled, so that means a lot more 
failures.

wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/  (my blog)


On Jul 15, 2015, at 7:21 AM, Jack Krupansky <jack.krupan...@gmail.com> wrote:

> It is simply precision (AND) vs. recall (OR) - the former tries to limit
> the total result count, while the latter tries to focus on relevancy of the
> top results even if the total result count is higher.
> 
> Recall is good for discovery and browsing, where you sort of know what you
> generally want, but not exactly with any great precision.
> 
> Recall will include results that almost meet the query terms, but maybe
> some are missing.
> 
> Precision will guarantee and insist that all query terms are present.
> 
> One great example for recall is a plagiarism query - enter all the terms
> for a passage and then find documents that most closely approximate the
> passage without being necessarily exact matches. IOW, the plagiarizer
> changes a word here and there.
> 
> -- Jack Krupansky
> 
> On Wed, Jul 15, 2015 at 8:16 AM, Steven White <swhite4...@gmail.com> wrote:
> 
>> Hi Everyone,
>> 
>> Out-of-the box, Solr (Lucene?) is set to use OR as the default Boolean
>> operator.  Can someone tell me the advantages / disadvantages of using OR
>> or AND as the default?
>> 
>> I'm leaning toward AND as the default because the more words a user types,
>> the narrower the result set should be.
>> 
>> Thanks
>> 
>> Steve
>> 

Reply via email to