Re: AW: UTF-8 2-byte vs 4-byte encodings

2007-05-02 Thread Gereon Steffens
Hi Chrisitian, > It is not sufficient to set the encoding in the XML but > you need an additional HTTP header to set the encoding ("Content-type: > text/xml; charset=UTF-8") Thanks, that's what I was missing. Gereon

AW: UTF-8 2-byte vs 4-byte encodings

2007-05-02 Thread Burkamp, Christian
Gereon, The four bytes do not look like a valid utf-8 encoded character. 4-byte characters in utf-8 start with the binary sequence "0...". (For reference see the excellent wikipedia article on utf-8 encoding). Your problem looks like someone interpreted your valid 2-byte utf-8 encoded chara