>In this case, the encoding is almost certainly "latin-1". I know that >from playing around at the interactive interpreter, like this: > > >>> s = 'M\xc9XICO' > >>> print s.decode('latin-1') > MÉXICO > >If you want to see charts of various encodings, wikipedia has a bunch. > For instance, the Latin-1 encoding is here: >http://en.wikipedia.org/wiki/ISO/IEC_8859-1 and UTF-8 is here: >http://en.wikipedia.org/wiki/Utf-8
Yep, it is. Thanks those charts are exactly what I wanted! Now I have another question. What is the difference between what print shows and what the interpreter shows? >>> print s.decode('latin-1') MÉXICO >>> s.decode('latin-1') u'M\xc9XICO' >>> print repr(s) 'M\xc9XICO' >>> repr(s) "'M\\xc9XICO'" Ramit Ramit Prasad | JPMorgan Chase Investment Bank | Currencies Technology 712 Main Street | Houston, TX 77002 work phone: 713 - 216 - 5423 This email is confidential and subject to important disclaimers and conditions including on offers for the purchase or sale of securities, accuracy and completeness of information, viruses, confidentiality, legal privilege, and legal entity disclaimers, available at http://www.jpmorgan.com/pages/disclosures/email. _______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: http://mail.python.org/mailman/listinfo/tutor