Re: stopwords and phrase queries

2008-03-25 Thread Vinci
Hi, I think Solr allow you to do asymmetric query processing and indexing.(*Not all the preprocessing can be asymmetric - stemming, lowercasing must be symmetric) To make the query work, at least you need to make the stop words to be indexed and then the query should not do the stop word removal

Re: stopwords and phrase queries

2008-03-25 Thread Sean Timm
Music is another domain where this is a real problem. E.g., "The The", "The Who", not to mention the song and album names. -Sean Walter Underwood wrote: We do a similar thing with a no stopword, no stemming field. There are a surprising number of movie titles that are entirely stopwords. "Be

Re: stopwords and phrase queries

2008-03-21 Thread Walter Underwood
We do a similar thing with a no stopword, no stemming field. There are a surprising number of movie titles that are entirely stopwords. "Being There" was the first one I noticed, but "To be and to have" wins the prize for being all-stopwords in two languages. See my list, here: http://wunderwood

RE: stopwords and phrase queries

2008-03-21 Thread Lance Norskog
Yes. Our in-house example is the movie title "The Sound Of Music". Given in quotes as a phrase this will pull up "anystopword Sound anystopword Music". For example, "A Sound With Music". Your example is also a test case of ours. For some Lucenicious reason "six stopwords in a row" does not find an