Re: Best way to match umlauts

2013-06-16 Thread Lance Norskog
One small thing: German u-umlaut is often "flattened" as 'ue' instead of 'u'. And the same with o-umlaut, it can be 'oe' or 'o'. I don't know if Lucene has a good solution for this problem. On 06/16/2013 06:44 AM, adityab wrote: Thanks for the explanation Steve. I now see it clearly. In my cas

Re: Best way to match umlauts

2013-06-16 Thread adityab
Thanks for the explanation Steve. I now see it clearly. In my case it should work. -- View this message in context: http://lucene.472066.n3.nabble.com/Best-way-to-match-umlauts-tp4070256p4070805.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Best way to match umlauts

2013-06-13 Thread Steve Rowe
Aditya, Char filters are applied prior to tokenization, so they can affect tokenization, but I can't think of any tokenization changes that accent stripping would cause. Token filters can be re-ordered to achieve certain objectives. For example, if you want to use a stemmer that only recogniz

Re: Best way to match umlauts

2013-06-13 Thread Jack Krupansky
Token filter character filter is a key difference. -- Jack Krupansky -Original Message- From: adityab Sent: Thursday, June 13, 2013 8:17 PM To: solr-user@lucene.apache.org Subject: Re: Best way to match umlauts this might be a dumb question. But can you please point me some key

Re: Best way to match umlauts

2013-06-13 Thread adityab
this might be a dumb question. But can you please point me some key difference between ASCIIFolding Filter and Character Filter using a map File. thanks Aditya -- View this message in context: http://lucene.472066.n3.nabble.com/Best-way-to-match-umlauts-tp4070256p4070398.html Sent from the

Re: Best way to match umlauts

2013-06-13 Thread Steve Rowe
On Jun 13, 2013, at 3:48 PM, Jack Krupansky wrote: > mapping="mapping-FoldToASCII"/> The mapping attribute above is missing the .txt file extension: Steve

Re: Best way to match umlauts

2013-06-13 Thread Jack Krupansky
age- From: adityab Sent: Thursday, June 13, 2013 2:18 PM To: solr-user@lucene.apache.org Subject: Re: Best way to match umlauts Just to confirm even "solr.ASCIIFoldingFilterFactory" should solve the purpose. am i correct ? -- View this message in context: http://lucene.47206

Re: Best way to match umlauts

2013-06-13 Thread adityab
Just to confirm even "solr.ASCIIFoldingFilterFactory" should solve the purpose. am i correct ? -- View this message in context: http://lucene.472066.n3.nabble.com/Best-way-to-match-umlauts-tp4070256p4070317.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Best way to match umlauts

2013-06-13 Thread jimtronic
Thanks! Sorry for the basic question, but I was having trouble finding the results through google. On Thu, Jun 13, 2013 at 10:39 AM, Jack Krupansky-2 [via Lucene] < ml-node+s472066n4070262...@n3.nabble.com> wrote: > mapping="mapping-ISOLatin1Accent.txt"/> > > -- Jack Krupansky > > -Original

Re: Best way to match umlauts

2013-06-13 Thread Jack Krupansky
mapping="mapping-ISOLatin1Accent.txt"/> -- Jack Krupansky -Original Message- From: jimtronic Sent: Thursday, June 13, 2013 11:31 AM To: solr-user@lucene.apache.org Subject: Best way to match umlauts I'm trying to make BrĂ¼no come up in my results when the user types in "Bruno". What's