I totally missed EdgeNGram. Good catch Alex!

Yeah, that's a killer. My shot in the dark here is that
your analysis chain isn't the best choice to support your use-case and you're
shooting yourself in the foot. So let's back up and talk
about your use-case and maybe re-define your analysis
chain for better performance.

Best,
Erick

On Thu, Sep 22, 2016 at 8:21 AM, Alexandre Rafalovitch
<arafa...@gmail.com> wrote:
> Well,
>
> I am guessing this is the line that's causing the problem:
> <filter class="solr.EdgeNGramFilterFactory" minGramSize="3"
> maxGramSize="50"/>
>
> Run your real sample for that field against your indexing definition
> in Admin UI and see how many tokens you end up with. You may have 50
> tokens, but if each of them generates up to 47 representations......
>
> Regards,
>     Alex.
> ----
> Newsletter and resources for Solr beginners and intermediates:
> http://www.solr-start.com/
>
>
> On 22 September 2016 at 22:08, slee <sleed...@gmail.com> wrote:
>> Here's what I have define in my schema:
>> <fieldType name="c_text" class="solr.TextField" positionIncrementGap="100">
>>     <analyzer type="index">
>>       <tokenizer class="solr.KeywordTokenizerFactory"/>
>>       <filter class="solr.LowerCaseFilterFactory"/>
>>       <filter class="solr.ASCIIFoldingFilterFactory"/>
>>       <filter class="solr.EdgeNGramFilterFactory" minGramSize="3"
>> maxGramSize="50"/>
>>     </analyzer>
>>     <analyzer type="query">
>>       <tokenizer class="solr.KeywordTokenizerFactory"/>
>>       <filter class="solr.LowerCaseFilterFactory"/>
>>       <filter class="solr.ASCIIFoldingFilterFactory"/>
>>     </analyzer>
>>   </fieldType>
>>
>> <field name="global_Value" type="c_text" multiValued="true" indexed="true"
>> required="true" stored="true"/>
>>
>> This is what I send in the query (2 values):
>> q=global_Value:*mas+AND+global_Value:*sef&df=text&rows=5&version=2.2&echoParams=explicit&fl=global_Value
>>
>> In addition, memory is taking way over 90%, given the heap space set at 5g.
>>
>>
>>
>>
>> --
>> View this message in context: 
>> http://lucene.472066.n3.nabble.com/Performance-Issue-when-querying-Multivalued-fields-SOLR-6-1-0-tp4297255p4297474.html
>> Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to