On Tue, Jun 15, 2010 at 09:20:31PM +0200, sean finney wrote:
> Package: python-debian
> Version: 0.1.16
> Severity: important
> 
> I was updating the codebase for the debian patch tracker, and have stumbled
> across what i believe is a regression.  Now that python-debian uses unicode
> internally (since 0.1.15 it seems), if a Sources file contains both utf-8
> and latin-1 encoded maintainer names (like the etch Sources file does),
> then it seems impossible to produce output from the resulting Sources 
> instance.

Ah, yuck. :(

I can think of two possible solutions:

  * Accept 'raw' as a Deb822 constructor encoding argument, or add a
    raw_strings keyword argument, that turns off the unicode behavior
    - Con: old code still breaks with mixed data - you have to change
      your code to use the new constructor argument
    - Pro: most consistent results (raw strings are only returned if you
      explicitly ask for them)
  
  * Wrap unicode stuff in try/except, and use the raw string if
    something goes wrong
    - Con: not as consistent results as above option
    - Pro: old code works out-of-box with mixed data

Which one do you think makes more sense?

-- 
John Wright <j...@debian.org>



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to