You are seeing the effects of the default QueryParser.
whitespace (like '+','-','"','*', etc...) is a "special character" to the
Lucene QueryParser. Un-Escaped/Quoted qhitespace tells the query parser
to construct a BooleanQuery containing multiple clauses -- each clause is
analyzed seperatel
I'm attempting to make use of PatternReplaceCharFilterFactory, but am running
into issues on both 1.4.1 ( I ported it) and on nightly (4.0-2010-07-27). It
seems that on a real query the charFilter isn't executed prior to the
tokenizer.
I modified the example configuration included in the dis