This is easy if I only reqdefine a custom field to identify the desired
patterns (numbers, in my case)
For example, I could define a field thus:
<!-- A text field that identifies numberical entities-->
<fieldType name="text_num" class="solr.TextField" >
<analyzer>
<tokenizer class="solr.PatternTokenizerFactory"
pattern="\s*[0-9][0-9-]*[0-9]?\s*" group="0"/>
</analyzer>
</fieldType>
Input:
hello, world bye 123-45 abcd 5555 sdfssdf --- aaa
Output:
123-45 , 5555
However, I also want to retain the behavio