Is this a bug? Wildcard with PatternReplaceFilterFactory
Attempting to normalize left and right single and double quotes for searches
‘ Left single quotation mark ' Single quote
’ Right single quotation mark ' Single quote
“ Left double quotation mark " Double quotes
” Right double quotation mark " Double quotes
<fieldType name="text_general" class="solr.TextField"
positionIncrementGap="100" multiValued="true">
<analyzer type="index">
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" />
<filter class="solr.WordDelimiterGraphFilterFactory"
preserveOriginal="1" catenateWords="1"/>
<filter class="solr.FlattenGraphFilterFactory"/> <!-- required
on index analyzers after graph filters -->
<filter class="solr.PatternReplaceFilterFactory" pattern="‘"
replacement="'"/>
<filter class="solr.PatternReplaceFilterFactory" pattern="’"
replacement="'"/>
<filter class="solr.PatternReplaceFilterFactory" pattern="“"
replacement="""/>
<filter class="solr.PatternReplaceFilterFactory" pattern="”"
replacement="""/>
<filter class="solr.LowerCaseFilterFactory"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.WordDelimiterGraphFilterFactory"
preserveOriginal="1" catenateWords="1"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" />
<filter class="solr.SynonymGraphFilterFactory"
synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
<filter class="solr.PatternReplaceFilterFactory" pattern="‘"
replacement="'"/>
<filter class="solr.PatternReplaceFilterFactory" pattern="’"
replacement="'"/>
<filter class="solr.PatternReplaceFilterFactory" pattern="“"
replacement="""/>
<filter class="solr.PatternReplaceFilterFactory" pattern="”"
replacement="""/>
<filter class="solr.LowerCaseFilterFactory"/>
</analyzer>
</fieldType>
The wildcard seems to NOT utilize the PatternReplaceFilterFactory
Rod’s finds fields Rod's and Rod’s that are now in the index as rod's
but *Rod’s* finds nothing because the index now only contains rod's