What do you want actual user queries to look like? I mean, having to
explicitly write asterisks after every term is a real pain.
Indexing ngrams has the advantage that phrase queries and edismax phrase
boosting work automatically. Phrases don't work with explicit wildcard
queries.
The only real d
Try it and see ;).
My experience is that wildcards work fine, although
what "fine" is up to you to decide _if_ you restrict
it to requiring at least two leading "real" characters,
and I actually prefer three. I.e.
ab* or abc*. Note that if you require leading
wildcards, use the reverse wildcard fi
On 6/27/2015 4:27 AM, octopus wrote:
> Hi, I'm looking at Solr's features for wildcard search used for a large
> amount of text. I read on the net that solr.EdgeNGramFilterFactory is used
> to generate tokens for wildcard searching.
>
> For Nigerian => "ni", "nig", "nige", "niger", "nigeri", "nig
That is one way to implement wildcarda, but isnt the most efficient.
Just index normally, tokenized, and search with an asterisk suffix, e.g.
foo*
This will build a finite state transformer that will make wildcard
handling efficient.
Upayavira
On, Jun 27, 2015, at 11:27 AM, pus wrote:
> Hi, I'm