StandardTokenizer and domain names containing digits

2012-04-19 Thread Alex Willmer
This is contrary to e.g. "ns.logica.define.com" which is treated as a single token. Is there a way I can make Solr treat both queries the same way? Many thanks, Alex -- Alex Willmer | Developer 2 Trinity Park, Birmingham, B37 7ES | United Kingdom M: +44 7557 752744 al.will...@logica.c

Re: StandardTokenizer and domain names containing digits

2012-04-23 Thread Alex Willmer
Steven A Rowe syr.edu> writes: > StandardTokenizer in Lucene/Solr v3.1+ implements the Word Boundary rules > from Unicode 6.0.0 Standard > Annex #29, a.k.a. UAX#29: . > These rules don't include recognition of URLs or domain nam