> How should that work? The first half of the surrogate pair has not > enough information to decide that. For instance, take the ranges > 0x10A01, 0x10A03 }, { 0x10A05, 0x10A06 }. The information about the low > 10 bits of the Unicode value is in the second half of the pair. From > the first half you don't know if the char is perhaps the 0x10A04 value > or one of the other. So you need both halves to make a decision.
You're right. I'd somehow overlooked the end of the combining[] array. Andy -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/