[issue4153] Unicode HOWTO up to date?

2010-11-18 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: r82301 appears to be a blind merge of r82120 from the trunk. It is fairly obvious that it was not intentional. -- ___ Python tracker <http://bugs.python.org/issue4

[issue4153] Unicode HOWTO up to date?

2010-11-18 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- nosy: +akuchling ___ Python tracker <http://bugs.python.org/issue4153> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue7828] chr() and ord() documentation for wide characters

2010-11-18 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- keywords: +patch stage: needs patch -> patch review Added file: http://bugs.python.org/file19633/issue7828.diff ___ Python tracker <http://bugs.python.org/iss

[issue7828] chr() and ord() documentation for wide characters

2010-11-18 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Committed in r86526 (3.2) and r86527 (3.1). -- resolution: -> fixed stage: patch review -> committed/rejected status: open -> closed ___ Python tracker <http://bugs.python.o

[issue4153] Unicode HOWTO up to date?

2010-11-18 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Thu, Nov 18, 2010 at 2:41 PM, Terry J. Reedy wrote: .. > I visually parse 0-1,114,111 as 0-1, 114, 111. So I think either the commas > should be removed or extra spaces are needed: 0-1114111 or 0 - 1,114,111. What about "0 through 1,114,1

[issue4153] Unicode HOWTO up to date?

2010-11-18 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Thu, Nov 18, 2010 at 3:00 PM, Alexander Belopolsky wrote: .. >> I really think of them as hex or hexadecimal digits, just as 0-9 are >> decimal, not base 10 digits. >> > > I am fine with "hexadecimal" here.  I did n

[issue2001] Pydoc interactive browsing enhancement

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Thu, Nov 18, 2010 at 2:37 AM, Ron Adam wrote: .. > I'll try reading and writing directly to the socket and working up some tests > from that. > I don't suppose there's something like that already in the test suite I can &g

[issue2001] Pydoc interactive browsing enhancement

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: issue2001_b.diff patch includes changes to urllib. Is this intentional? Is it a bug fix, a feature? There is no mention in the NEWS file. If these changes are needed for pydoc enhancements, I would like to separate them in its own issue and commit

[issue2001] Pydoc interactive browsing enhancement

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: issue2001_c.diff is the same as issue2001_b.diff, but without urlparse changes and with minor modifications to pydoc.rst resolving a conflict with a recent commit. I have also uploaded the same patch to rietveld: http://codereview.appspot.com/3187042

[issue4113] Add custom __repr__ to functools.partial

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: There is an ongoing discussion about deprecating undocumented PyUnicode_AppendAndDel(). See Marc-Andre's comment in msg121371: """ +.. c:function:: void PyUnicode_Append(PyObject **pleft, PyObject *right) + + Concat two strings a

[issue4153] Unicode HOWTO up to date?

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Committed in revision 86530. Thanks Terry and Raymond for your comments. I would like to keep this issue open (at a low priority) because the question in the titles is still relevant. There are many new 3.x features that are not covered such as

[issue10461] Use with statement throughout the docs

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: +1 BTW, I've updated examples in Unicode HOWTO to use with. -- nosy: +belopolsky ___ Python tracker <http://bugs.python.org/is

[issue9769] PyUnicode_FromFormatV() doesn't handle non-ascii text correctly

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: I don't understand Victor's argument in msg115889. According to UTF-8 RFC, <http://www.ietf.org/rfc/rfc2279.txt>: - US-ASCII values do not appear otherwise in a UTF-8 encoded character stream. This provides compatibility w

[issue9769] PyUnicode_FromFormatV() doesn't handle non-ascii text correctly

2010-11-19 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Fri, Nov 19, 2010 at 3:06 PM, STINNER Victor wrote: > .. Whereas PyUnicode_FromFormatV() converts the format string > (bytes) to unicode (characters). If you would like a comparaison in C, it's > like printf()+mbstowcs() in the same func

[issue9769] PyUnicode_FromFormatV() doesn't handle non-ascii text correctly

2010-11-20 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Fri, Nov 19, 2010 at 7:15 PM, STINNER Victor wrote: .. > > Why should we do that? ASCII format is just fine. Remember that > PyUnicode_FromFormatV() is part of the C API. I don't think that anyone would > use non-ASCII format in C.

[issue10371] Deprecate trace module undocumented API

2010-11-20 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Committed in revision 86594. -- resolution: -> accepted stage: patch review -> committed/rejected status: open -> closed ___ Python tracker <http://bugs.python.or

[issue10461] Use with statement throughout the docs

2010-11-20 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: FWIW, I find the "with" variant much easier to read than >>> words = re.findall('\w+', open('hamlet.txt').read().lower()) Too many operations in one line. It would be even better with >>

[issue10461] Use with statement throughout the docs

2010-11-20 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- Removed message: http://bugs.python.org/msg121711 ___ Python tracker <http://bugs.python.org/issue10461> ___ ___ Python-bug

[issue1646068] Dict lookups fail if sizeof(Py_ssize_t) < sizeof(long)

2010-11-20 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- status: pending -> closed ___ Python tracker <http://bugs.python.org/issue1646068> ___ ___ Python-bugs-list mailing list Un

[issue9305] Don't use east/west of UTC in date/time documentation

2010-11-20 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: LGTM -- ___ Python tracker <http://bugs.python.org/issue9305> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue9305] Don't use east/west of UTC in date/time documentation

2010-11-20 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- assignee: d...@python -> belopolsky stage: needs patch -> commit review ___ Python tracker <http://bugs.python.org/

[issue9305] Don't use east/west of UTC in date/time documentation

2010-11-20 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On the second reading, I have a few issues with the patch. 1. Please run makepatcheck. There are whitespace issues in datetime.rst. 2. In docstrings, you remove the information about the sign. I would not mind leaving docstrings the way they are

[issue10474] range.count returns boolean

2010-11-20 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: SilentGhost, Benjamin's fix only affect the optimized path when the argument is int or bool. Exotic comparison rules are handled in the else clause. -- nosy: +belopolsky ___ Python tracker

[issue8646] PyUnicode_EncodeDecimal is undocumented

2010-11-20 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- resolution: -> duplicate status: open -> closed superseder: -> Document unicode C-API in reST ___ Python tracker <http://bugs.python.o

[issue8645] PyUnicode_AsEncodedObject is undocumented

2010-11-20 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- resolution: -> duplicate status: open -> closed superseder: -> Document unicode C-API in reST ___ Python tracker <http://bugs.python.o

[issue8647] PyUnicode_GetMax is undocumented

2010-11-20 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- resolution: -> duplicate superseder: -> Document unicode C-API in reST ___ Python tracker <http://bugs.python.org/

[issue10427] 24:00 Hour in DateTime

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 21, 2010 at 1:24 PM, Rodrigo Bernardo Pimentel wrote: .. > I was writing tests for this issue, when something struck me: ok, > datetime(year, month, day, 24) is valid. > But is datetime(year, month, day, 24, 1) valid? Or datetime(ye

[issue10427] 24:00 Hour in DateTime

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 21, 2010 at 3:48 PM, Mark Dickinson wrote: .. > Well, if all that's wanted is for hour==24 to be legal on input, with the > datetime object > itself being automatically normalized at creation time, then the choices seem >

[issue10493] test_strptime failures under OpenIndiana

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: AssertionError: strftime does not support standard '%d' format (day of month as number (00-31)) Sounds pretty self-explainatory. I kind of doubt, however, that OpenIndiana's strftime is that broken. Jesús,

[issue10494] Demo/comparisons/regextest.py needs some usage information.

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: +0 Not "+1" because Georg suggested that this demo be removed. See http://mail.python.org/pipermail/python-ideas/2010-October/008416.html On the patch, I would define a "usage" variable near the top of the program instead of repe

[issue8525] Small enhancement to help()

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: The following passes tests in elp_8525.patch, but is much simpler: === --- Lib/pydoc.py(revision 86600) +++ Lib/pydoc.py(working copy) @@ -1139,6 +1139,15

[issue10138] calendar module does not support years outside [1, 9999] range

2010-11-21 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- nosy: +rhettinger ___ Python tracker <http://bugs.python.org/issue10138> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue10087] HTML calendar is broken

2010-11-21 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- nosy: +rhettinger ___ Python tracker <http://bugs.python.org/issue10087> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue10498] calendar.LocaleHTMLCalendar.formatyearpage() results in traceback with 'unsupported locale setting' on Windows

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: See also #10087. -- nosy: +belopolsky ___ Python tracker <http://bugs.python.org/issue10498> ___ ___ Python-bugs-list m

[issue10497] Incorrect use of gettext in argparse

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: The patch makes sense. I wonder if i10n of programs using argparse should be mentioned in docs. Providing a sample translation file somewhere may not be a bad idea at least for testing. -- nosy: +belopolsky

[issue10138] calendar module does not support years outside [1, 9999] range

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Fixed in revision 86669. -- resolution: -> fixed stage: -> committed/rejected status: open -> closed ___ Python tracker <http://bugs.python.or

[issue10087] HTML calendar is broken

2010-11-21 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: It may be acceptable, but your patch elides the line breaks in output. Can you add unit tests? -- ___ Python tracker <http://bugs.python.org/issue10

[issue10087] HTML calendar is broken

2010-11-22 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Mon, Nov 22, 2010 at 9:48 AM, Chris Lambacher wrote: .. > I don't understand what you mean by "elides the line breaks in output". It is actually not that bad: $ ./python.exe -m calendar -t html| wc -l 121 $ python2.7 -m calen

[issue10087] HTML calendar is broken

2010-11-22 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Mon, Nov 22, 2010 at 11:55 AM, Chris Lambacher wrote: .. > 1. Any suggestions about how to test the output of the console program (the > case that this bug affects) > would be appreciated. For month displays, a doctest may be most appropr

[issue10435] Document unicode C-API in reST

2010-11-22 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Wed, Nov 17, 2010 at 5:20 PM, Marc-Andre Lemburg wrote: .. > -/* Encodes a Unicode object and returns the result as Python string > +/* Encodes a Unicode object and returns the result as Python bytes >    object. */ > > > PyUnico

[issue6878] changed return type from tkinter.Canvas.coords

2010-11-22 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Committed in r86697 (3.2) and r86698 (3.1). @Éric: I tried to minimize whitespace changes in the commit, but if you see indentation that can be improved, let me know and I'll fix it separately. @SilentGhost: Yes, the problem is real. For ex

[issue6878] changed return type from tkinter.Canvas.coords

2010-11-22 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- resolution: -> fixed stage: patch review -> committed/rejected status: open -> closed ___ Python tracker <http://bugs.python.o

[issue10087] HTML calendar is broken

2010-11-22 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: I wonder: do we really need "encoding" option to the calendar output? It is really not the job of the calendar module to deal with encodings. The calendar module should produce text calendars as unicode strings and HTML calendars as UTF-8

[issue8525] Display exception's subclasses in help()

2010-11-22 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: The proposal is to display builtin subclasses as for example: >>> help(ArithmeticError) class ArithmeticError(Exception) | Base class for arithmetic errors. | | Method resolution order: | ArithmeticError |

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-24 Thread Alexander Belopolsky
New submission from Alexander Belopolsky : >>> 'xyz'.center(20, '\U00100140') Traceback (most recent call last): File "", line 1, in TypeError: The fill character must be exactly one character long str.ljust and str.rjust are similarly affected.

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-24 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Wed, Nov 24, 2010 at 10:33 AM, Antoine Pitrou wrote: .. > The question is, what should it do with such an input? I think the rule for such functions should be that if input.encode('utf-8') is the same on wide and narrow builds, then the

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-24 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Wed, Nov 24, 2010 at 3:37 PM, Marc-Andre Lemburg wrote: .. > I don't think we should change that for the formatting methods. That's a reasonable position. What about 'Lo' >>> '\N{OLD ITALIC LETTER A}'.is

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-24 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Wed, Nov 24, 2010 at 3:37 PM, Marc-Andre Lemburg wrote: .. > I don't think we should change that for the formatting methods. That's a reasonable position. What about >>> unicodedata.category('\N{OLD ITALIC LETTER A}'

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-24 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- Removed message: http://bugs.python.org/msg122313 ___ Python tracker <http://bugs.python.org/issue10521> ___ ___ Python-bug

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-24 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Here is another str method not ready for non-BMP chars: >>> u = '\U00010140' >>> u.translate({ord(u):ord('A')}) '𐅀' (expected 'A') >>> u = 'B' >>> u.translate({ord

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-24 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Here is another proof of concept patch for the isalpha issue that introduces a higher level abstraction macro - Py_UNICODE_NEXT. It should be possible to reuse this macro in all isxyz methods and other places where surrogates are currently processed

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-26 Thread Alexander Belopolsky
New submission from Alexander Belopolsky : As discussed in issue 10521 and the sprawling "len(chr(i)) = 2?" thread [1] on python-dev, many functions in python library behave differently on narrow and wide builds. While there are unavoidable differences such as the length of string

[issue9200] str.isprintable() is always False for large code points

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: AFAICT, all ctype methods (isalpha, isdigit, etc.) have the same problem. I posted a patch at issue10542 that introduces a Py_UNICODE_NEXT() macro that can help fixing all these methods. I am adding #10542 as a dependency and if there are no

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-26 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- nosy: +haypo, loewis ___ Python tracker <http://bugs.python.org/issue10542> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue10521] str methods don't accept non-BMP fillchar on a narrow Unicode build

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Fri, Nov 26, 2010 at 6:37 PM, Terry J. Reedy wrote: > > Terry J. Reedy added the comment: > > As a practical matter, I think that for at least the next decade, people are > at least as likely to > want to fill with a composed, m

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Fri, Nov 26, 2010 at 7:27 PM, Eric Smith wrote: .. > > In addition to the proposed Py_UNICODE_NEXT and Py_UNICODE_PUT_NEXT, > > str.__format__ would also need a function that tells it how many Py_UNICODEs > are needed to store a give

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Fri, Nov 26, 2010 at 7:45 PM, Eric Smith wrote: .. > For my use I'd really need it to take the result of Py_UNICODE_NEXT. > Something like: > Py_ssize_t > Py_UNICODE_NUM_NEEDED(Py_UCS4 c) > and it would always return 1 or 2. Alway

[issue9200] str.isprintable() is always False for large code points

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Attached patch fixes isprintable and other ctype-like methods. I left isspace() out for now because I could not find a test character outside of BMP to test with, but I think we should fix that for completeness as well. At this point the goal is

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Fri, Nov 26, 2010 at 8:41 PM, STINNER Victor wrote: .. > I don't like macro having a result and using multiple instructions using the > evil > magic trick (the ","). It's harder to maintain the code and harder to debug

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Fri, Nov 26, 2010 at 9:22 PM, Eric Smith wrote: .. > But I definitely agree that we should get the abstraction right first and > worry about > the implementation later. I am fairly happy with Py_UNICODE_NEXT() abstraction. It's seman

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-26 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Raymond, I wonder if you would like to comment on the iterator analogy and/or on adding public names to C API. -- nosy: +rhettinger ___ Python tracker <http://bugs.python.org/issue10

[issue10541] regrtest.py -T broken

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Apparently something in the test changes the cwd. Both of the following invocations work: $ ./python.exe `pwd`/Lib/test/regrtest.py -T -N test_urllib $ ./python.exe -m test.regrtest -T -N test_urllib I would suggest changing the coverage target in the

[issue10549] help(cls1) breaks when cls1 has staticmethod(cls2) attribute

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Confirmed in py3k. -- assignee: -> belopolsky nosy: +belopolsky stage: -> needs patch type: -> behavior ___ Python tracker <http://bugs.python.or

[issue10549] help(cls1) breaks when cls1 has staticmethod(cls2) attribute

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: The fix is simple: --- Lib/pydoc.py(revision 86824) +++ Lib/pydoc.py(working copy) @@ -1110,7 +1110,7 @@ result = result + self.section('FILE', file) return result -def docclass(self, object, name=None

[issue10541] regrtest.py -T broken

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: I would like to investigate this some more. In theory, regrtest should restore cwd before coverage results are written. -- status: pending -> open ___ Python tracker <http://bugs.python.org/issu

[issue10552] Tools/unicode/gencodec.py error

2010-11-27 Thread Alexander Belopolsky
New submission from Alexander Belopolsky : $ ../../python.exe gencodec.py MAPPINGS/VENDORS/MISC/ build/ converting APL-ISO-IR-68.TXT to build/apl_iso_ir_68.py and build/apl_iso_ir_68.mapping converting ATARIST.TXT to build/atarist.py and build/atarist.mapping converting CP1006.TXT to build

[issue7962] Demo and Tools need to be tested and pruned

2010-11-27 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- dependencies: +Tools/unicode/gencodec.py error ___ Python tracker <http://bugs.python.org/issue7962> ___ ___ Python-bug

[issue10552] Tools/unicode/gencodec.py error

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Attached patch addresses the issue by using -1 instead of None for missing codes. Comparison of generated encoding files to those in Lib/encodings shows only whitespace changes except one which appears to be a change on the unicode.org side: diff -b

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sat, Nov 27, 2010 at 5:03 PM, Marc-Andre Lemburg wrote: .. [I'll respond to skipped when I update the patch] > In any case, we should clearly document where these macros are used and > warn about the implications of using them in the w

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sat, Nov 27, 2010 at 5:03 PM, Marc-Andre Lemburg wrote: .. >  * same for the Py_UNICODE_NEXT() macro, i.e. Py_UCS4_NEXT() > >  * in order to make the macro easier to understand, please rename it to > Py_UCS4_READ_CODE_POINT(); that

[issue10552] Tools/unicode/gencodec.py error

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Attached patch uses MISSING_CODE as Mark suggested. There are still errors apparently because parsecodes() may return either an int or a tuple. I think only mac encodings are affected, so I would like to commit the current patch before tackling this

[issue10552] Tools/unicode/gencodec.py error

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Please ignore Makefile changes in the patch. -- ___ Python tracker <http://bugs.python.org/issue10552> ___ ___ Python-bug

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sat, Nov 27, 2010 at 5:41 PM, Ezio Melotti wrote: > > Ezio Melotti added the comment: > >> * the Py_UNICODE_JOIN_SURROGATES() macro should use Py_UCS4 as prefix since >> it returns Py_UCS4 values, i.e. Py_UCS4_JOIN_SURROGATE

[issue10542] Py_UNICODE_NEXT and other macros for surrogates

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: I am attaching a patch that defines Py_UNICODE_PUT_NEXT() macro (tentative name) and uses it to fix str.upper method. The implementation of surrogate-aware str.upper shows that NEXT/PUT_NEXT abstractions may lead to somewhat inefficient code for &qu

[issue10557] Malformed error message from float()

2010-11-27 Thread Alexander Belopolsky
New submission from Alexander Belopolsky : >>> float('½') Traceback (most recent call last): File "", line 1, in ValueError: could not convert string to float: � >>> float('42½') Traceback (most recent call last): File "", line 1,

[issue10318] "make altinstall" installs many files with incorrect shebangs

2010-11-27 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: -#!/usr/bin/env python +#!/usr/bin/env python3 # -*- coding: UTF-8 -*- Not strictly related to this issue, but do we want to recommend redundant encoding cookie in the docs? -- ___ Python tracker

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 28, 2010 at 11:37 AM, Mark Dickinson wrote: > > Mark Dickinson added the comment: > >> I am not sure, whether support for non-ascii digits in float() >> constructor is worth maintaining. > > I'd be very hap

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 28, 2010 at 12:00 PM, Mark Dickinson wrote: > > Mark Dickinson added the comment: > > About Alexander's solution:  might it make more sense to have > PyUnicode_EncodeDecimal raise > for inputs like this? No, I thin

[issue10567] Unicode space character \u200b unrecognised a space

2010-11-28 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- nosy: +belopolsky ___ Python tracker <http://bugs.python.org/issue10567> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue10567] Unicode space character \u200b unrecognised a space

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: The category of U-200B was changed in Unicode 4.0.1: """ The main new features in Unicode 4.0.1 are the following: ... * Changed: general category of U+200B ZERO WIDTH SPACE """ http://unicode.org/versions/Unicod

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Issue #10567 demonstrated the problem of relying on the Unicode database in Python builtins. Apparently, Unicode does not guarantee stability of the character categories. On the other hand, we are already tied to UCD for the language definition. Maybe

[issue10567] Unicode space character \u200b unrecognised a space

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 28, 2010 at 2:07 PM, Marc-Andre Lemburg wrote: .. > The tables were never manually maintained, but we also did not update > Python for each new Unicode version: > > Python 1.6: Unicode 3.0 > Python 2.0: Unicode 3.0 > Pytho

[issue10567] Unicode space character \u200b unrecognised a space

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 28, 2010 at 2:40 PM, Marc-Andre Lemburg wrote: .. > Going back further shows the change: > > 3.0.1: 200B;ZERO WIDTH SPACE;Zs;0;BN;N; > 3.2.0: 200B;ZERO WIDTH SPACE;Zs;0;BN;N; > 4.0.1: 200B;ZERO WIDTH SPAC

[issue10568] Python and the Unicode Character Database

2010-11-28 Thread Alexander Belopolsky
New submission from Alexander Belopolsky : Two recently reported issues brought into light the fact that Python language definition is closely tied to character properties maintained by the Unicode Consortium. [1,2] For example, when Python switches to Unicode 6.0.0 (planned for the upcoming

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 28, 2010 at 3:09 PM, Stefan Krah wrote: .. > Decimal point: U+066B Well, not so fast: Traceback (most recent call last): File "", line 1, in UnicodeEncodeError: 'decimal' codec can't encode character &#x

[issue10568] Python and the Unicode Character Database

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: This was meant as python-dev post, not an issue. (Sent to wrong address by mistake.) -- nosy: +belopolsky resolution: -> invalid status: open -> closed ___ Python tracker <http://bugs.python.org/i

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 28, 2010 at 3:30 PM, Stefan Krah wrote: .. >> UnicodeEncodeError: 'decimal' codec can't encode character '\u066b' > > Hmm, looks like a bug? I think U+066B is correct. > Really? What about Traceback

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Sending this by e-mail was not a good idea ... On Sun, Nov 28, 2010 at 3:30 PM, Stefan Krah wrote: .. >> UnicodeEncodeError: 'decimal' codec can't encode character '\u066b' > > Hmm, looks like a bug? I think U+0

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- Removed message: http://bugs.python.org/msg122725 ___ Python tracker <http://bugs.python.org/issue10557> ___ ___ Python-bug

[issue10562] Change 'j' for imaginary unit into an 'i'

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Why not allow complex('1i')? (Tongue in cheek: I am really looking for arguments against lax parsing in number builtins. See issue 10557.) -- nosy: +belopolsky ___ Python tracker <http://bu

[issue10562] Change 'j' for imaginary unit into an 'i'

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Sun, Nov 28, 2010 at 3:58 PM, Mark Dickinson wrote: .. > I'd expect to allow complex('3 + 4i') as well. And with spaces surrounding '+' too. -- ___ Python tracker <ht

[issue10563] Spurious newline in time.ctime

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: This looks like a bug in the underlying platform. POSIX requires [1] that the output of ctime() fits in a 26-character buffer. Note that a change has been recently made to time.asctime() to reject year > . See r85137 and issue6608. I think

[issue10563] Spurious newline in time.ctime

2010-11-28 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- assignee: -> belopolsky stage: unit test needed -> needs patch type: -> feature request versions: +Python 3.2 -Python 2.6, Python 2.7, Python 3.1 ___ Python tracker <http://bugs.python.or

[issue10544] yield expression inside generator expression does nothing

2010-11-28 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- nosy: +belopolsky ___ Python tracker <http://bugs.python.org/issue10544> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue10544] yield expression inside generator expression does nothing

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: Isn't this the same issue as #3267? -- ___ Python tracker <http://bugs.python.org/issue10544> ___ ___ Python-bugs-l

[issue10557] Malformed error message from float()

2010-11-28 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: After a bit of svn archeology, it does appear that Arabic-Indic digits' support was deliberate at least in the sense that the feature was tested for when the code was first committed. See r15000. The test migrated from file to file over the la

[issue10557] Malformed error message from float()

2010-11-29 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: On Mon, Nov 29, 2010 at 4:41 AM, Marc-Andre Lemburg wrote: .. > It would be better to copy and iterate over the Unicode string first, > replacing any decimal code points with ASCII ones and then call the > UTF-8 encoder. > Good idea. >

[issue10572] Move unittest test package to Lib/test

2010-11-29 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: +0, and I think we should hear from the maintainers of the affected packages first. For packages that are also externally maintained moving tests out may cause inconvenience to the maintainer. -- nosy: +barry, belopolsky, brett.cannon

[issue10572] Move unittest test package to Lib/test

2010-11-29 Thread Alexander Belopolsky
Changes by Alexander Belopolsky : -- nosy: +gpolo ___ Python tracker <http://bugs.python.org/issue10572> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue10575] makeunicodedata.py does not support Unihan digit data

2010-11-29 Thread Alexander Belopolsky
Alexander Belopolsky added the comment: I am adding #10552 as a dependency because I think we should fix unicode data generation in 3.x before adding new features to the scripts. I am also not sure whether this is a bug or a feature request. Martin? -- dependencies: +Tools/unicode

<    1   2   3   4   5   6   7   8   9   10   >