https://issues.apache.org/bugzilla/show_bug.cgi?id=45957
           Summary: UTF-8 POST request results in corrupted data
           Product: Tomcat 5
           Version: 5.5.27
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Connector:HTTP
        AssignedTo: dev@tomcat.apache.org
        ReportedBy: [EMAIL PROTECTED]


Currently in Tomcat 5 if a request is received containing UTF-8 content then
any accents or non-Roman characters are corrupted, since there is an assumption
the POST request is ISO-8895-1 (latin1). For example 'é' becomes 'é'. A
work around is specified on the following page:

   http://wiki.apache.org/tomcat/Tomcat/UTF-8

in the form of a Filter class, called 'CharsetFilter'. IMHO, this is not the
most ideal solution for a site that is wanting to standardise on UTF-8.

It should be noted that in the same document, referenced above, there is an
option to specify the encoding of the URI. It would be handy to have something
equivalent for the POST content, maybe something like "POSTencoding",
defaulting to ISO-8859-1 as is used currently, and then providing the necessary
code to use UTF-8 instead.

This issue is valid for both Tomcat 5 and 6.

additional keywords: internationalisation, internationalization, i18n


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to