Francois PIETTE wrote:
>> The twothird character is not 'encoded' either as "⅔"
>> (decimal) or as "⅔" (hex)? If so, IIS sends plain UTF-16!
> 
> Yes, no encoding at all. Just the 3 bytes. So UTF-16.

But 3 bytes looks like UTF-8 ?

--
Arno Garrels

> 
> --
> [EMAIL PROTECTED]
> http://www.overbyte.be
> 
> 
> ----- Original Message -----
> From: "Arno Garrels" <[EMAIL PROTECTED]>
> To: "ICS support mailing" <[email protected]>
> Sent: Thursday, October 09, 2008 5:26 PM
> Subject: Re: [twsocket] HTML encoding in HttpSrv func.
> TextToHtmlText() 
> 
> 
>> Francois Piette wrote:
>>>> Yes, if someone has Apache or a newer IIS installed he could help.
>>>> Create a file name with characters not in current ANSI code page by
>>>> copy those characters from the Windows application charmap.exe.
>>>> Than start a packet sniffer and log a directory listing.
>>> 
>>> Using IIS6 on W2K3.
>> 
>> Thanks!
>> 
>>> The twothird character (U+2154) is sent in the dirlist as 3
>>> characters : 0xE2 0x85 0x94. In the href link, the 3 characters are
>>> expressed as %e2%85%94
>> 
>> That's UTF-8 URL-encoded.
>> 
>>> while they are binary in the text itself.
>> 
>> The twothird character is not 'encoded' either as "&#8532;"
>> (decimal) or as "&#x2154;" (hex)? If so, IIS sends plain UTF-16!
>> 
>>> There is nothing in the html header to tell which code page or
>>> charset is used. --
>> 
>> Browsers seem to be very good in detecting the correct character set
>> nowadays.
>> 
>> --
>> Arno Garrels
>> --
>> To unsubscribe or change your settings for TWSocket mailing list
>> please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket
>> Visit our website at http://www.overbyte.be
-- 
To unsubscribe or change your settings for TWSocket mailing list
please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket
Visit our website at http://www.overbyte.be

Reply via email to