https://issues.apache.org/bugzilla/show_bug.cgi?id=44724
Summary: CGIServlet: undesirable encoding conversion of CGI
output
Product: Tomcat 6
Version: unspecified
Platform: All
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: Native:Integration
AssignedTo: [EMAIL PROTECTED]
ReportedBy: [EMAIL PROTECTED]
Created an attachment (id=21750)
--> (https://issues.apache.org/bugzilla/attachment.cgi?id=21750)
CGI script which outputs Russian text in UTF-8
Tomcat CGIServlet which is distributed with Tomcat performs undesirable
conversion of CGI script output encoding.
This conversion is performed indirectly by the servlet (which passes data
through a memory stream) and depends on character encoding environment
variables (LANG/LC_ALL/...), at least on UNIX-like systems.
Steps to reproduce:
1. Install Tomcat
2. Create a CGI script which outputs non-ascii7 data.
3. Set up Tomcat for running CGI scripts (a la
http://tomcat.apache.org/tomcat-6.0-doc/cgi-howto.html)
4. Set up any single-byte language environment (for example, say 'export
LANG=C' in bash)
5. Start Tomcat instance in that single-byte language environment
6. Make a request to CGI script.
All non-ascii7 characters in the CGI output will be replaced with quotation
marks.
This behaviour is different from any other CGI implementation and is unexpected
by any "normal" user.
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]