What do you want actual user queries to look like? I mean, having to explicitly write asterisks after every term is a real pain.
Indexing ngrams has the advantage that phrase queries and edismax phrase boosting work automatically. Phrases don't work with explicit wildcard queries. The only real downside to ngrams is that they explode the size of the index. But memory is supposed to be cheap these days. I mean, compare the cost of the extra RAM (to keep the full index in memory) to the cost to users of tehir productivity constructing queries and having expensive staff to help them figure out why various queries don't work as expected. How big is your corpus - number of documents and average document size? -- Jack Krupansky On Sat, Jun 27, 2015 at 6:27 AM, octopus <octroll...@gmail.com> wrote: > Hi, I'm looking at Solr's features for wildcard search used for a large > amount of text. I read on the net that solr.EdgeNGramFilterFactory is used > to generate tokens for wildcard searching. > > For Nigerian => "ni", "nig", "nige", "niger", "nigeri", "nigeria", > "nigeria", "nigerian" > > However, I have a large amount of text out there which requires wildcard > search and it's not viable to use EdgeNGrameFilterFactory as the amount of > processing will be too huge. Do you have any suggestions/advice please? > > Thank you so much for your time! > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-Wildcard-Search-for-large-amount-of-text-tp4214392.html > Sent from the Solr - User mailing list archive at Nabble.com. >