On Tue, 2008-02-19 at 22:55 -0800, Matthew Mueller wrote:
> On Tue, Feb 19, 2008 at 09:31:41PM +0100, Alain Kalker wrote:
> > When verifying .md5 or .cfv files which contain filenames containing 
> > chars with values > 127 (in ISO-8859), cfv reports the corresponding 
> > files as missing.
> > I don't know if this problem is related to bug #406761, but that bug 
> > suggests a specific problem with the encoding of UTF-8 characters in 
> > .torrent files. If these problems are related, please feel free to merge 
> > the bugs.
> 
> It's similar to that bug, but not the same cause.  In this case, cfv
> (1.x) has no understanding of encodings for text checksum files, and
> since the files are encoded in iso-8859-? but your system is probably
> using utf-8 for filenames, the bytes just aren't going to match up.  For
> the text-based formats you can work around this pretty easily by
> manually converting the files:
> "iconv -f iso-8859-1 -t utf-8 foo.md5 > foo.utf8.md5"
> 
> Therefore, I say that this isn't really a bug at all.  cfv 1.x certainly
> can verify files with chars > 127 as long as the encoding matches the
> one used by your system.
> 
> If you want, you can consider it as a feature request for built-in
> encoding handling.  Actually, the cfv 2.x devel code already has
> encoding knowledge so you could just do:
> "cfv --encoding=iso-8859-1 -f foo.md5"
> However I'm (slowly) doing some major refactoring on it and it's not
> ready for release yet :(

Thanks for the very quick reply and helpful workaround! Yes, my system
is setup to use UTF-8, and the checksum files I used use one of the
iso-8859 encodings.

Perhaps it would be a good idea to mention your workaround in
README.Debian, so others faced with this problem don't have to check
bug reports for the solution.

Please consider the encoding issue a feature request, and good luck
with your rewrite.

Kind regards,
Alain



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to