https://issues.apache.org/bugzilla/show_bug.cgi?id=45957
Summary: UTF-8 POST request results in corrupted data Product: Tomcat 5 Version: 5.5.27 Platform: All OS/Version: All Status: NEW Severity: normal Priority: P2 Component: Connector:HTTP AssignedTo: dev@tomcat.apache.org ReportedBy: [EMAIL PROTECTED] Currently in Tomcat 5 if a request is received containing UTF-8 content then any accents or non-Roman characters are corrupted, since there is an assumption the POST request is ISO-8895-1 (latin1). For example 'é' becomes 'é'. A work around is specified on the following page: http://wiki.apache.org/tomcat/Tomcat/UTF-8 in the form of a Filter class, called 'CharsetFilter'. IMHO, this is not the most ideal solution for a site that is wanting to standardise on UTF-8. It should be noted that in the same document, referenced above, there is an option to specify the encoding of the URI. It would be handy to have something equivalent for the POST content, maybe something like "POSTencoding", defaulting to ISO-8859-1 as is used currently, and then providing the necessary code to use UTF-8 instead. This issue is valid for both Tomcat 5 and 6. additional keywords: internationalisation, internationalization, i18n -- Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]