[issue9061] cgi.escape Can Lead To XSS Vulnerabilities

2010-06-23 Thread Craig Younkins

New submission from Craig Younkins :

The method in question: http://docs.python.org/library/cgi.html#cgi.escape
http://svn.python.org/view/python/tags/r265/Lib/cgi.py?view=markup   # at the 
bottom
http://code.python.org/hg/trunk/file/3be6ff1eebac/Lib/cgi.py#l1031

"Convert the characters '&', '<' and '>' in string s to HTML-safe sequences. 
Use this if you need to display text that might contain such characters in 
HTML. If the optional flag quote is true, the quotation mark character ('"') is 
also translated; this helps for inclusion in an HTML attribute value, as in . If the value to be quoted might include single- or double-quote 
characters, or both, consider using the quoteattr() function in the 
xml.sax.saxutils module instead."

cgi.escape never escapes single quote characters, which can easily lead to a 
Cross-Site Scripting (XSS) vulnerability. This seems to be known by many, but a 
quick search reveals many are using cgi.escape for HTML attribute escaping.

The intended use of this method is unclear to me. Up to and including Mako 
0.3.3, this method was the HTML escaping method. Used in this manner, 
single-quoted attributes with user-supplied data are easily susceptible to 
cross-site scripting vulnerabilities.

While the documentation says "if the value to be quoted might include single- 
or double-quote characters... [use the] xml.sax.saxutils module instead," it 
also implies that this method will make input safe for HTML. Because this 
method escapes 4 of the 5 key XML characters, it is reasonable to expect some 
will use it for HTML escaping.

I suggest rewording the documentation for the method making it more clear what 
it should and should not be used for. I would like to see the method changed to 
properly escape single-quotes, but if it is not changed, the documentation 
should explicitly say this method does not make input safe for inclusion in 
HTML.

This is definitely affecting the security of some Python web applications. I 
already mentioned Mako, but I've found this type of bug in other frameworks and 
engines because the creators either called cgi.escape directly or modeled their 
own after it.

Craig Younkins

--
assignee: d...@python
components: Documentation, Library (Lib)
messages: 108457
nosy: Craig.Younkins, d...@python
priority: normal
severity: normal
status: open
title: cgi.escape Can Lead To XSS Vulnerabilities
versions: Python 2.5, Python 2.6, Python 2.7, Python 3.1, Python 3.2, Python 3.3

___
Python tracker 
<http://bugs.python.org/issue9061>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue9061] cgi.escape Can Lead To XSS Vulnerabilities

2010-06-23 Thread Craig Younkins

Changes by Craig Younkins :


--
type:  -> security

___
Python tracker 
<http://bugs.python.org/issue9061>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue9061] cgi.escape Can Lead To XSS Vulnerabilities

2010-06-23 Thread Craig Younkins

Craig Younkins  added the comment:

Proof of concept:
print """""" % cgi.escape("' onload='alert(1);' bad='")

--

___
Python tracker 
<http://bugs.python.org/issue9061>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue9061] cgi.escape Can Lead To XSS Vulnerabilities

2010-06-23 Thread Craig Younkins

Craig Younkins  added the comment:

> cgi.escape is for HTML attribute escaping only.

It is not safe for HTML attribute escaping because it does not encode single 
quotes.

> "More suitable" for HTML would be the correct interpretation rather make the 
> "input safe".

"More suitable, but not quite secure"

Regardless of the intended use of this method, many many people are using it 
for insecure HTML entity escaping.

> you should explain or point out to resources where 
> 'single quotes' representation in a non-entity format 
> in a HTML page has lead to XSS.

print "" % cgi.escape("' onload='alert(1);' bad='")

> The very next paragraph seems to address the security considerations
> while using the cgi module itself, rather than limiting it to
> cgi.escape. It says that:
> "To be on the safe side, if you must pass a string gotten from a form
> to a shell command, you should make sure the string contains only
> alphanumeric characters, dashes, underscores, and periods."

The security concerns related to output on the web are very different from the 
concerns related sending user input to a shell command. The needed escaping is 
completely different. Also, the security advice above is woefully inadequate. 

> Any doc change suggestions you propose?

Convert the characters '&', '<' and '>' in string s to their HTML entity 
encoded values. If the optional flag quote is true, the double-quotation mark 
character ('"') is also encoded. Note that the output of this method is not 
safe to put in an HTML attribute because it does not escape single quotes. If 
the value to be quoted might include single- or double-quote characters, or 
both, consider using the quoteattr() function in the xml.sax.saxutils module 
instead.

> If cgi.escape needs to escape single quotes, what should it be as:
> lsquo/rsquo (for XHTML) and ' or ' for Others?

Sorry, I should have included that in the OP. It should escape to ' 
It is also advised to escape the forward slash character ('/') to /

See OWASP.org for an explanation of the complexities of the escaping:
http://www.owasp.org/index.php/XSS_(Cross_Site_Scripting)_Prevention_Cheat_Sheet#RULE_.231_-_HTML_Escape_Before_Inserting_Untrusted_Data_into_HTML_Element_Content

--

___
Python tracker 
<http://bugs.python.org/issue9061>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com