One small thing: German u-umlaut is often "flattened" as 'ue' instead of
'u'. And the same with o-umlaut, it can be 'oe' or 'o'. I don't know if
Lucene has a good solution for this problem.
On 06/16/2013 06:44 AM, adityab wrote:
Thanks for the explanation Steve. I now see it clearly. In my cas
Thanks for the explanation Steve. I now see it clearly. In my case it should
work.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Best-way-to-match-umlauts-tp4070256p4070805.html
Sent from the Solr - User mailing list archive at Nabble.com.
Aditya,
Char filters are applied prior to tokenization, so they can affect
tokenization, but I can't think of any tokenization changes that accent
stripping would cause.
Token filters can be re-ordered to achieve certain objectives. For example, if
you want to use a stemmer that only recogniz
Token filter character filter is a key difference.
-- Jack Krupansky
-Original Message-
From: adityab
Sent: Thursday, June 13, 2013 8:17 PM
To: solr-user@lucene.apache.org
Subject: Re: Best way to match umlauts
this might be a dumb question. But can you please point me some key
this might be a dumb question. But can you please point me some key
difference between ASCIIFolding Filter and Character Filter using a map
File.
thanks
Aditya
--
View this message in context:
http://lucene.472066.n3.nabble.com/Best-way-to-match-umlauts-tp4070256p4070398.html
Sent from the
On Jun 13, 2013, at 3:48 PM, Jack Krupansky wrote:
> mapping="mapping-FoldToASCII"/>
The mapping attribute above is missing the .txt file extension:
Steve
age-
From: adityab
Sent: Thursday, June 13, 2013 2:18 PM
To: solr-user@lucene.apache.org
Subject: Re: Best way to match umlauts
Just to confirm even "solr.ASCIIFoldingFilterFactory" should solve the
purpose.
am i correct ?
--
View this message in context:
http://lucene.47206
Just to confirm even "solr.ASCIIFoldingFilterFactory" should solve the
purpose.
am i correct ?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Best-way-to-match-umlauts-tp4070256p4070317.html
Sent from the Solr - User mailing list archive at Nabble.com.
Thanks! Sorry for the basic question, but I was having trouble finding the
results through google.
On Thu, Jun 13, 2013 at 10:39 AM, Jack Krupansky-2 [via Lucene] <
ml-node+s472066n4070262...@n3.nabble.com> wrote:
> mapping="mapping-ISOLatin1Accent.txt"/>
>
> -- Jack Krupansky
>
> -Original
mapping="mapping-ISOLatin1Accent.txt"/>
-- Jack Krupansky
-Original Message-
From: jimtronic
Sent: Thursday, June 13, 2013 11:31 AM
To: solr-user@lucene.apache.org
Subject: Best way to match umlauts
I'm trying to make BrĂ¼no come up in my results when the user types in
"Bruno".
What's
10 matches
Mail list logo