> How should that work?  The first half of the surrogate pair has not
> enough information to decide that.  For instance, take the ranges
> 0x10A01, 0x10A03 }, { 0x10A05, 0x10A06 }.  The information about the low
> 10 bits of the Unicode value is in the second half of the pair.  From
> the first half you don't know if the char is perhaps the 0x10A04 value
> or one of the other.  So you need both halves to make a decision.

You're right. I'd somehow overlooked the end of the combining[] array.

Andy

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

Reply via email to