Also consider WordDelimterFilterFactory, which will break up the
tokens on upper/lower case transitions.

to get relevance, consider edismax-style query parsers and adding
automatic phrase generation (with boosts usually).

This one will be a problem:
q=bestbuy

There's no good generic way to get this to split up. One
possibility is to use synonyms if the list is known, but
otherwise there's no information here to distinguish it
from "legitimate" words.

edgeNgrams work on _tokens_, not words so I doubt
they would help in this case either since there is only
one token.

Best
Erick


On Tue, Aug 20, 2013 at 3:16 AM, tamanjit.bin...@yahoo.co.in <
tamanjit.bin...@yahoo.co.in> wrote:

> Additionally, if you dont want results like q=best and result=bestbuy; you
> can use <charFilter class="solr.PatternReplaceCharFilterFactory"
> pattern="\W+" replacement=""/> to actually replace whitespaces with
> nothing.
>
>
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#CharFilterFactories
> <
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#CharFilterFactories
> >
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/What-filter-to-use-to-search-with-spaces-omitted-included-between-words-tp4085576p4085601.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to