https://issues.apache.org/bugzilla/show_bug.cgi?id=44724

           Summary: CGIServlet: undesirable encoding conversion of CGI
                    output
           Product: Tomcat 6
           Version: unspecified
          Platform: All
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Native:Integration
        AssignedTo: [EMAIL PROTECTED]
        ReportedBy: [EMAIL PROTECTED]


Created an attachment (id=21750)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=21750)
CGI script which outputs Russian text in UTF-8

Tomcat CGIServlet which is distributed with Tomcat performs undesirable
conversion of CGI script output encoding.

This conversion is performed indirectly by the servlet (which passes data
through a memory stream) and depends on character encoding environment
variables (LANG/LC_ALL/...), at least on UNIX-like systems.

Steps to reproduce:

1. Install Tomcat 
2. Create a CGI script which outputs non-ascii7 data.
3. Set up Tomcat for running CGI scripts (a la
http://tomcat.apache.org/tomcat-6.0-doc/cgi-howto.html)
4. Set up any single-byte language environment  (for example, say 'export
LANG=C' in bash)
5. Start Tomcat instance in that single-byte language environment
6. Make a request to CGI script.

All non-ascii7 characters in the CGI output will be replaced with quotation
marks.

This behaviour is different from any other CGI implementation and is unexpected
by any "normal" user.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to