RE: Split token

2011-04-15 Thread Steven A Rowe
he.org > Subject: Re: Split token > > What you've shown would be handled with WhitespaceTokenizer, but you'd > have > to > prevent filters from stripping the parens. If you have to handle things > like > blah ( stuff ) > WhitespaceTokenizer wouldn't work

Re: Split token

2011-04-15 Thread Erick Erickson
What you've shown would be handled with WhitespaceTokenizer, but you'd have to prevent filters from stripping the parens. If you have to handle things like blah ( stuff ) WhitespaceTokenizer wouldn't work. PatternTokenizerFactory might work for you, see: http://lucene.apache.org/solr/api/org/apach