https://issues.apache.org/bugzilla/show_bug.cgi?id=44724
Summary: CGIServlet: undesirable encoding conversion of CGI output Product: Tomcat 6 Version: unspecified Platform: All OS/Version: Linux Status: NEW Severity: normal Priority: P2 Component: Native:Integration AssignedTo: [EMAIL PROTECTED] ReportedBy: [EMAIL PROTECTED] Created an attachment (id=21750) --> (https://issues.apache.org/bugzilla/attachment.cgi?id=21750) CGI script which outputs Russian text in UTF-8 Tomcat CGIServlet which is distributed with Tomcat performs undesirable conversion of CGI script output encoding. This conversion is performed indirectly by the servlet (which passes data through a memory stream) and depends on character encoding environment variables (LANG/LC_ALL/...), at least on UNIX-like systems. Steps to reproduce: 1. Install Tomcat 2. Create a CGI script which outputs non-ascii7 data. 3. Set up Tomcat for running CGI scripts (a la http://tomcat.apache.org/tomcat-6.0-doc/cgi-howto.html) 4. Set up any single-byte language environment (for example, say 'export LANG=C' in bash) 5. Start Tomcat instance in that single-byte language environment 6. Make a request to CGI script. All non-ascii7 characters in the CGI output will be replaced with quotation marks. This behaviour is different from any other CGI implementation and is unexpected by any "normal" user. -- Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]