Francois PIETTE wrote: >> The twothird character is not 'encoded' either as "⅔" >> (decimal) or as "⅔" (hex)? If so, IIS sends plain UTF-16! > > Yes, no encoding at all. Just the 3 bytes. So UTF-16.
But 3 bytes looks like UTF-8 ? -- Arno Garrels > > -- > [EMAIL PROTECTED] > http://www.overbyte.be > > > ----- Original Message ----- > From: "Arno Garrels" <[EMAIL PROTECTED]> > To: "ICS support mailing" <[email protected]> > Sent: Thursday, October 09, 2008 5:26 PM > Subject: Re: [twsocket] HTML encoding in HttpSrv func. > TextToHtmlText() > > >> Francois Piette wrote: >>>> Yes, if someone has Apache or a newer IIS installed he could help. >>>> Create a file name with characters not in current ANSI code page by >>>> copy those characters from the Windows application charmap.exe. >>>> Than start a packet sniffer and log a directory listing. >>> >>> Using IIS6 on W2K3. >> >> Thanks! >> >>> The twothird character (U+2154) is sent in the dirlist as 3 >>> characters : 0xE2 0x85 0x94. In the href link, the 3 characters are >>> expressed as %e2%85%94 >> >> That's UTF-8 URL-encoded. >> >>> while they are binary in the text itself. >> >> The twothird character is not 'encoded' either as "⅔" >> (decimal) or as "⅔" (hex)? If so, IIS sends plain UTF-16! >> >>> There is nothing in the html header to tell which code page or >>> charset is used. -- >> >> Browsers seem to be very good in detecting the correct character set >> nowadays. >> >> -- >> Arno Garrels >> -- >> To unsubscribe or change your settings for TWSocket mailing list >> please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket >> Visit our website at http://www.overbyte.be -- To unsubscribe or change your settings for TWSocket mailing list please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket Visit our website at http://www.overbyte.be
