> Is there any ready-for-use filter which performs acronyms > normalization such > as "I.N.C."->"INC"? > > I see that Lucene's StandardFilter can do this but we can't > use it as we're > using WhitespaceTokenizer instead of StandardTokenizer. >
I am bad at regular expressions but if you can write a regex for that
replacement solr.PatternReplaceFilterFactory can do that.
<filter class="solr.PatternReplaceFilterFactory" pattern="([^a-z])"
replacement="" replace="all" />
