On 6/8/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
I am trying to use the solr.HTMLStripWhitespaceTokenizerFactory analyzer with no luck.
[...]
Is this normal? Shouldn't the html code and the white spaces be removed from the field?
For indexing purposes, yes. The stored field you get back will be unchanged though. If you want to see what will be indexed, try the analysis debugger in the admin pages. -Yonik