Hello Solr users,

I have a need to write a tokenizer for source code files in Solr, but don't
have the option of including custom JARs. So for ex:

Input: foo.bar

Tokens: 'foo', '.', 'bar'

How can I have a custom tokenizer or filter in schema.xml that can split on
some characters, but also not drop the character ?

I tried Regex pattern tokenizer but that drops the delimiters.
Thanks in advance!

Reply via email to