Re: Stripping Punctuation in a fieldType

2010-01-16 Thread Chris Hostetter
: Subject: Stripping Punctuation in a fieldType : In-Reply-To: <27179780.p...@talk.nabble.com> : References: : : <27178423.p...@talk.nabble.com> <27179780.p...@talk.nabble.com> http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists W

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Lance Norskog
If your user query syntax has a small number of features, you could write your own query parser. On Fri, Jan 15, 2010 at 4:18 PM, Koji Sekiguchi wrote: > David, > > PatternReplaceCharFilterFactory accepts pattern and replacement args. > Please read PatternReplaceCharFilter javadoc to see few samp

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Koji Sekiguchi
David, PatternReplaceCharFilterFactory accepts pattern and replacement args. Please read PatternReplaceCharFilter javadoc to see few samples: http://lucene.apache.org/solr/api/org/apache/solr/analysis/PatternReplaceCharFilter.html Koji -- http://www.rondhuit.com/en/ David Seltzer wrote: Do

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
ommas for example using this technique? Thanks! -Dave -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Friday, January 15, 2010 2:32 PM To: solr-user@lucene.apache.org Subject: Re: Stripping Punctuation in a fieldType Ah, ok, your approach makes sense. Mos

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Robert Muir
t >> languages? (My assumption regarding the WhiteSpaceTokenizer is that it >> would be very language/direction neutral) >> >> > Could you explain a bit about *why* you want this behavior? >> In short we have to support multiple languages and match the behavior of >&

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Erick Erickson
r? > In short we have to support multiple languages and match the behavior of > an existing non-solr system. > > -Dave > > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Friday, January 15, 2010 1:42 PM > To: solr-user@lucene.apach

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Robert Muir
t;> >> Right now when I search for "Obama" I'm not getting any hits on > "Obama." >> >> So I'm basically looking to strip punctuation. The consequence would > be >> that "nation's", "nations" and "nations,&quo

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Friday, January 15, 2010 1:42 PM To: solr-user@lucene.apache.org Subject: Re: Stripping Punctuation in a fieldType If you haven't seen it, this page is invaluable for this kind of question: http://wiki.apache.org/s

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Erick Erickson
ing with stemming? > > Thanks for everyone's input! > > -Dave > > > > -Original Message- > From: Ahmet Arslan [mailto:iori...@yahoo.com] > Sent: Friday, January 15, 2010 12:42 PM > To: solr-user@lucene.apache.org > Subject: Re: Stripping Punctuation in

RE: Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
ay. Would the StandardTokenizerFactory accomplish this? Does it have any language specific functionality? Does it do anything with stemming? Thanks for everyone's input! -Dave -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: Friday, January 15, 2010 12:42 PM T

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Ahmet Arslan
> I'm trying to find the best way to set up a fieldType that > strips punctuation. Use solr.StandardTokenizerFactory that strips punctuations. Or if you do not care about alphanumeric or numeric queries use solr.LowerCaseTokenizerFactory that uses LetterTokenizer. I think the right way to do

Re: Stripping Punctuation in a fieldType

2010-01-15 Thread Otis Gospodnetic
gt; To: solr-user@lucene.apache.org > Sent: Fri, January 15, 2010 12:31:41 PM > Subject: Stripping Punctuation in a fieldType > > Hello All, > > I'm trying to find the best way to set up a fieldType that strips > punctuation. I think the right way to do this is using a Chara

Stripping Punctuation in a fieldType

2010-01-15 Thread David Seltzer
Hello All, I'm trying to find the best way to set up a fieldType that strips punctuation. I think the right way to do this is using a CharacterFilter of some type, but I can't seem to find any examples of how to set this up in a schema.xml file. Can anyone point me in the right direction? Right