On Fri, Apr 22, 2011 at 3:09 PM, Bently Preece <preec...@umn.edu> wrote: > What if there is no standard localization already? The case I'm > specifically interested in is Ojibwe. >
this is standard? to sort a field with a specific locale, you have to tell it the locale you want. if you use the ICU implementation you get support for more locales, its just that simple. The JRE has less available locales because its internationalization and localization support lags behind ICU. On the other hand ICU keeps current with both the unicode standard and locale data in CLDR (http://unicode.org/cldr), which is why it supports more. I noticed there is no locale for your language in CLDR, not even under development it appears (http://unicode.org/cldr/apps/survey). So if your language (Ojibwe) has special sort rules, I recommend making the collation rules and using a custom collator as specified here: http://wiki.apache.org/solr/UnicodeCollation#Sorting_text_with_custom_rules for your "base collator" you just need to use "new Locale()" and your rules will be a delta from that. Separately, if these sort rules are well-defined/standardized for this language, and you get them working, you might want to then consider contributing them to CLDR.