Re: Solr Wildcard Search for large amount of text

2015-06-27 Thread Jack Krupansky
What do you want actual user queries to look like? I mean, having to explicitly write asterisks after every term is a real pain. Indexing ngrams has the advantage that phrase queries and edismax phrase boosting work automatically. Phrases don't work with explicit wildcard queries. The only real d

Re: Solr Wildcard Search for large amount of text

2015-06-27 Thread Erick Erickson
Try it and see ;). My experience is that wildcards work fine, although what "fine" is up to you to decide _if_ you restrict it to requiring at least two leading "real" characters, and I actually prefer three. I.e. ab* or abc*. Note that if you require leading wildcards, use the reverse wildcard fi

Re: Solr Wildcard Search for large amount of text

2015-06-27 Thread Shawn Heisey
On 6/27/2015 4:27 AM, octopus wrote: > Hi, I'm looking at Solr's features for wildcard search used for a large > amount of text. I read on the net that solr.EdgeNGramFilterFactory is used > to generate tokens for wildcard searching. > > For Nigerian => "ni", "nig", "nige", "niger", "nigeri", "nig

Re: Solr Wildcard Search for large amount of text

2015-06-27 Thread Upayavira
That is one way to implement wildcarda, but isnt the most efficient. Just index normally, tokenized, and search with an asterisk suffix, e.g. foo* This will build a finite state transformer that will make wildcard handling efficient. Upayavira On, Jun 27, 2015, at 11:27 AM, pus wrote: > Hi, I'm