On Fri, Apr 22, 2011 at 3:09 PM, Bently Preece <preec...@umn.edu> wrote:
> What if there is no standard localization already?  The case I'm
> specifically interested in is Ojibwe.
>

this is standard? to sort a field with a specific locale, you have to
tell it the locale you want. if you use the ICU implementation you get
support for more locales, its just that simple. The JRE has less
available locales because its internationalization and localization
support lags behind ICU.

On the other hand ICU keeps current with both the unicode standard and
locale data in CLDR (http://unicode.org/cldr), which is why it
supports more.

I noticed there is no locale for your language in CLDR, not even under
development it appears (http://unicode.org/cldr/apps/survey).

So if your language (Ojibwe) has special sort rules, I recommend
making the collation rules and using a custom collator as specified
here: 
http://wiki.apache.org/solr/UnicodeCollation#Sorting_text_with_custom_rules

for your "base collator" you just need to use "new Locale()" and your
rules will be a delta from that.

Separately, if these sort rules are well-defined/standardized for this
language, and you get them working, you might want to then consider
contributing them to CLDR.

Reply via email to