I'm feeling I found an issue in Solr Reference Guide for Simplified Regular Expression Pattern [Splitting ]Tokenizer (https://lucene.apache.org/ solr/guide/7_3/tokenizers.html#simplified-regular- expression-pattern-splitting-tokenizer).
Given example is <analyzer> <tokenizer class="solr.SimplePatternSplitTokenizerFactory" pattern="[ \t\r\n]+"/></analyzer> but Lucene's RegExp constructor consumes raw unicode characters instead of \t\r\n form, so correct configuration is <tokenizer class="solr.SimplePatternSplitTokenizerFactory" pattern="[ 	& #xA;
]+"/> -- Nikolay Khitrin