Re: facets and stopwords

JCodina Wed, 08 Jul 2009 04:58:16 -0700

hossman wrote:
> 
> 
> but are you sure that example would actually cause a problem?
> i suspect if you index thta exact sentence as is you wouldn't see the 
> facet count for "si" or "que" increase at all.
> 
> If you do a query for "{!raw field=content}que" you bypass the query 
> parsers (which is respecting your stopwords file) and see all docs that 
> contain the raw term "que" in the content field.
> 
> if you look at some of the docs that match, and paste their content field 
> into the analysis tool, i think you'll see that the problem comes from 
> using the whitespace tokenizer, and is masked by using the WDF 
> after the stop filter ... things like "Que?" are getting ignored by the 
> stopfilter, but ultimately winding up in your index as "que"
> 
> 
> -Hoss
> 
> 

Yes your are right, que? que, que... i need to change the analyzer. They are
not detected by the stopwords analyzer because i use the whitespace
tokenizer, I will use the StanadardTokenizer

Thanks Hoss

-- 
View this message in context: 
http://www.nabble.com/facets-and-stopwords-tp23952823p24390157.html
Sent from the Solr - User mailing list archive at Nabble.com.
Re: facets and stopwords

Reply via email to