Re: PatternTokenizer failure

2011-11-29 Thread Michael Kuhlmann
Am 29.11.2011 15:20, schrieb Erick Erickson: Hmmm, I tried this in straight Java, no Solr/Lucene involved and the behavior I'm seeing is that no example works if it has more than one whitespace character after the hyphen, including your failure example. I haven't lived inside regexes for long en

Re: PatternTokenizer failure

2011-11-29 Thread Erick Erickson
Hmmm, I tried this in straight Java, no Solr/Lucene involved and the behavior I'm seeing is that no example works if it has more than one whitespace character after the hyphen, including your failure example. I haven't lived inside regexes for long enough that I don't know what the right regex sho

PatternTokenizer failure

2011-11-28 Thread Jay Luker
Hi all, I'm trying to use PatternTokenizer and not getting expected results. Not sure where the failure lies. What I'm trying to do is split my input on whitespace except in cases where the whitespace is preceded by a hyphen character. So to do this I'm using a negative look behind assertion in th