Re: What filter to use to search with spaces omitted/included between words?

Erick Erickson Tue, 20 Aug 2013 04:22:06 -0700

Also consider WordDelimterFilterFactory, which will break up the
tokens on upper/lower case transitions.

to get relevance, consider edismax-style query parsers and adding
automatic phrase generation (with boosts usually).

This one will be a problem:
q=bestbuy

There's no good generic way to get this to split up. One
possibility is to use synonyms if the list is known, but
otherwise there's no information here to distinguish it
from "legitimate" words.

edgeNgrams work on _tokens_, not words so I doubt
they would help in this case either since there is only
one token.

Best
Erick

On Tue, Aug 20, 2013 at 3:16 AM, tamanjit.bin...@yahoo.co.in <
tamanjit.bin...@yahoo.co.in> wrote:

> Additionally, if you dont want results like q=best and result=bestbuy; you
> can use <charFilter class="solr.PatternReplaceCharFilterFactory"
> pattern="\W+" replacement=""/> to actually replace whitespaces with
> nothing.
>
>
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#CharFilterFactories
> <
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#CharFilterFactories
> >
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/What-filter-to-use-to-search-with-spaces-omitted-included-between-words-tp4085576p4085601.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Re: What filter to use to search with spaces omitted/included between words?

Reply via email to