Hi Solr-Team,

The manual of charfilters says that one can chain them: (from
https://lucene.apache.org/solr/guide/6_6/charfilterfactories.html#CharFilterFactories-solr.MappingCharFilterFactory
):

CharFilters can be chained like Token Filters and placed in front of a
Tokenizer. CharFilters can add, change, or remove characters while
preserving the original character offsets to support features like
highlighting.

I am trying to filter out some of the chars from some fields, so I can do
an efficient and effective faceting later. I tried to chaing charfilters
for that purpose:

<fieldType name="fcomp_type" class="solr.TextField"
positionIncrementGap="100">
<analyzer>
<!-- taking the filename: from the path-->
<charFilter class="solr.PatternReplaceCharFilterFactory"
pattern="(.*[/\\])([^/\\]+)$"   replacement="$2"/>
<charFilter class="solr.PatternReplaceCharFilterFactory"
pattern="([0-9\-]+)T([0-9\-]+)" replacement="$1 $2"/>
<charFilter class="solr.PatternReplaceCharFilterFactory"
pattern="[^a-zA-Z]+"            replacement=" "/>

<tokenizer  class="solr.StandardTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
</analyzer>
</fieldType>
<field name="fcomp"                   type="fcomp_type"   indexed="true"
stored="true"/>

but in schema definition I see only the last charfilter
[image: image.png]

Any clues why?

Cheers,
Arturas

Reply via email to