Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
Hello All, I'm trying to find the best way to set up a fieldType that strips punctuation. I think the right way to do this is using a CharacterFilter of some type, but I can't seem to find any examples of how to set this up in a schema.xml file. Can anyone point me in the right direction? Right

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
I'm hesitant to change Tokenizers at the moment because what we have is working so nicely - or so I thought. What I'm looking for is case-insensitive search for words and numbers without any of the stemming features turned on. The new requirement is that we take punctuation out of the mix. Right

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
ming built in. And what do you mean by "language specific functionality"? They do NOT fold accents for instance if that's what you're getting at. Could you explain a bit about *why* you want this behavior? HTH Erick On Fri, Jan 15, 2010 at 1:17 PM, David Seltzer wrote: >

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
tly I was trying to insure that you weren't flying blind. Perhaps you would find some joy with PatternReplaceCharFilterFactory, replacing all non-alphanum with empty string? HTH Erick On Fri, Jan 15, 2010 at 2:07 PM, David Seltzer wrote: > Hi Erik, > > Thanks for your thoughtfu