Thanks a lot Koji; I'll do some deep diving on my tokenizer modification part.
appreciate the pointers! Koji Sekiguchi-2 wrote: > > uwdanny wrote: >> in this "error" case, the origin query "q=pizza" >> >> <field name="TEST_KEYWORDS" type="text_keep_stopwords" multiValued="true" >> indexed="true" stored="true" termVectors="false" omitNorms="true"/> >> >> <fieldType name="text_keep_stopwords" class="solr.TextField" >> positionIncrementGap="100"> >> <analyzer type="index"> >> <tokenizer >> class="org.apache.lucene.analysis.PhraseTokenFactory" >> phraseSynonyms="phrase_synonyms.txt" includeSubphrases="true"/> >> <filter >> class="org.apache.lucene.analysis.ApostropheTokenFactory"/> >> <filter class="solr.SynonymFilterFactory" >> synonyms="headings_synonyms.txt" ignoreCase="true" expand="true" >> tokenizerFactory="org.apache.lucene.analysis.PhraseTokenFactory"/> >> <filter class="solr.SynonymFilterFactory" >> synonyms="listing_name_synonyms.txt" ignoreCase="true" expand="true" >> tokenizerFactory="org.apache.lucene.analysis.PhraseTokenFactory"/> >> <filter class="solr.SynonymFilterFactory" >> synonyms="space_variants.txt" ignoreCase="true" expand="true" >> tokenizerFactory="org.apache.lucene.analysis.PhraseTokenFactory"/> >> <filter class="solr.WordDelimiterFilterFactory" >> generateWordParts="0" generateNumberParts="1" catenateWords="0" >> catenateNumbers="1" catenateAll="0" splitOnCaseChange="1" >> preserveOriginal="1"/> >> <filter class="solr.LowerCaseFilterFactory"/> >> <filter >> class="org.apache.lucene.analysis.KStemFilterFactory" cacheSize="20000"/> >> </analyzer> >> <analyzer type="query"> >> <tokenizer >> class="org.apache.lucene.analysis.PhraseTokenFactory"/> >> <filter class="solr.WordDelimiterFilterFactory" >> generateWordParts="0" generateNumberParts="1" catenateWords="0" >> catenateNumbers="1" catenateAll="0" splitOnCaseChange="1" >> preserveOriginal="1"/> >> <filter class="solr.LowerCaseFilterFactory"/> >> <filter >> class="org.apache.lucene.analysis.KStemFilterFactory" cacheSize="20000"/> >> </analyzer> >> </fieldType> >> >> thanks >> >> ----- >> the best is yet to come~ >> > What is PhraseTokenFactory in the above? > If the Tokenizer's end() method doesn't work correctly, > you may get the trouble you were facing. > > Also consult: > https://issues.apache.org/jira/browse/LUCENE-2207 > > Koji > > -- > http://www.rondhuit.com/en/ > > > ----- the best is yet to come~ -- View this message in context: http://old.nabble.com/highlight-multi-valued-field-returns-weird-cut-off-highlighted-terms-tp27785795p27797310.html Sent from the Solr - User mailing list archive at Nabble.com.