please see http://wiki.apache.org/solr/UnicodeCollation

In general the idea is similar to how this is handled in databases,
you can index collation keys into a sort field at analysis time, then
you just do a standard solr sort.

However, I am not sure if your JRE provides a "haw" Locale for the
Hawaiian language.

Because of this, its probably better to use the ICU collation
integration 
(http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ICUCollationKeyFilterFactory),
because ICU definitely supports this locale and has collation rules
for it.

On Fri, Apr 22, 2011 at 12:33 PM, Ben Preece <preec...@umn.edu> wrote:
> As someone who's new to Solr/Lucene, I'm having trouble finding information
> on sorting results in localized alphabetical order. I've ineffectively
> searched the wiki and the mail archives.
>
> I'm thinking for example about Hawai'ian, where mīka (with an i-macron)
> comes after mika (i without the macron) but before miki (also without the
> macron), or about Welsh, where the digraphs (ch, dd, etc.) are treated as
> single letters, or about Ojibwe, where the apostrophe ' is a letter which
> sorts between h and i.
>
> How do non-English languages typically handle this?
>
> -Ben
>

Reply via email to