> The twothird character is not 'encoded' either as "⅔" (decimal) or
> as "⅔" (hex)? If so, IIS sends plain UTF-16! 

Yes, no encoding at all. Just the 3 bytes. So UTF-16.

-- 
[EMAIL PROTECTED]
http://www.overbyte.be


----- Original Message ----- 
From: "Arno Garrels" <[EMAIL PROTECTED]>
To: "ICS support mailing" <[email protected]>
Sent: Thursday, October 09, 2008 5:26 PM
Subject: Re: [twsocket] HTML encoding in HttpSrv func. TextToHtmlText()


> Francois Piette wrote:
>>> Yes, if someone has Apache or a newer IIS installed he could help.
>>> Create a file name with characters not in current ANSI code page by
>>> copy those characters from the Windows application charmap.exe.
>>> Than start a packet sniffer and log a directory listing.
>> 
>> Using IIS6 on W2K3.
> 
> Thanks!
> 
>> The twothird character (U+2154) is sent in the dirlist as 3
>> characters : 0xE2 0x85 0x94. In the href link, the 3 characters are
>> expressed as %e2%85%94 
> 
> That's UTF-8 URL-encoded.
> 
>> while they are binary in the text itself.
> 
> The twothird character is not 'encoded' either as "&#8532;" (decimal) or
> as "&#x2154;" (hex)? If so, IIS sends plain UTF-16! 
> 
>> There is nothing in the html header to tell which code page or
>> charset is used. --
> 
> Browsers seem to be very good in detecting the correct character set
> nowadays.
> 
> --
> Arno Garrels
> -- 
> To unsubscribe or change your settings for TWSocket mailing list
> please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket
> Visit our website at http://www.overbyte.be
-- 
To unsubscribe or change your settings for TWSocket mailing list
please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket
Visit our website at http://www.overbyte.be

Reply via email to