Re: Excluding large tokens from indexing

2010-07-29 Thread Paul Dlug
Thanks, that's exactly what I was looking for, not sure how I missed it. On Thu, Jul 29, 2010 at 11:28 AM, Chantal Ackermann wrote: > This is probably what you want? > > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.LengthFilterFactory > > > > On Thu, 2010-07-29 at 15:44 +0200,

Re: Excluding large tokens from indexing

2010-07-29 Thread Chantal Ackermann
This is probably what you want? http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.LengthFilterFactory On Thu, 2010-07-29 at 15:44 +0200, Paul Dlug wrote: > Is there a filter available that will remove large tokens from the > token stream? Ideally something configurable to a chara

Excluding large tokens from indexing

2010-07-29 Thread Paul Dlug
Is there a filter available that will remove large tokens from the token stream? Ideally something configurable to a character limit? I have a noisy data set that has some large tokens (in this case more than 50 characters) that I'd like to just strip. They're unlikely to ever match a user query an