Re: Solr search on a large text field is very slow

2013-08-08 Thread Erick Erickson
If you really need wildcards on both ends, I don't think edgengram will work, you need plain ngrams. The trick is that the query side needs to make the 2-grams into phrases. BTW, I think you'd be fine with bigrams. Best Erick On Thu, Aug 8, 2013 at 1:47 PM, meena.sri...@mathworks.com < meena

Re: Solr search on a large text field is very slow

2013-08-08 Thread meena.sri...@mathworks.com
Thanks for your responses, helped me to understand the issue. Digged through the documentation and now I am implementing EdgeNGramFilterFactory to see how fastly can I improve wild card searches.

Re: Solr search on a large text field is very slow

2013-08-08 Thread Aloke Ghoshal
Compare timings in the following cases: - Without the wildcard - With suffix wild card only - test* - With reverse wild card filter factory and two separate terms - *test OR test* On Thu, Aug 8, 2013 at 8:15 PM, meena.sri...@mathworks.com < meena.sri...@mathworks.com> wrote: > Index size is arou

Re: Solr search on a large text field is very slow

2013-08-08 Thread Rafał Kuć
Hello! In general, wildcard queries can be expensive. If you need wildcard queries, you can try the EdgeNGram - http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.EdgeNGramFilterFactory -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - ElasticSearch > In

Re: Solr search on a large text field is very slow

2013-08-08 Thread Erick Erickson
Sometimes bolding comes through my e-mail as *, so is *test* with the asterisk on each end really what you're doing? Assuming so, this will inevitably be slow. It must iterate through all the terms in the field to see if any of them match. This is generally a bad practice. You can to go n-grams if