On 6/12/2013 11:18 AM, bja...@jamesgang.dyndns.org wrote:
I've updated this code and to make it more easily readible put it in a
github repo https://github.com/CyberCowboy/FindDuplicates

Everything is working, however the code is hard to read and I'll be
working on cleaning that up, as well as splitting the program into 3
different functions (one that gets hashes, one that finds and identifies
the duplicates, and one that outputs the results)

However I'm having a problem in that if during the hashing faze a filename
with non-ascii characters is encountered the file errors out.  Since this
is going to be used at work and we have a large number of Chinese and
Arabic filenames I need to have the search allow a unicode character set.
How would I go about doing this? Python 2.7 btw.

Feed os.walk a unicode path and you'll get unicode filenames back.

--
Sincerely,

Chris Calloway http://nccoos.org/Members/cbc
office: 3313 Venable Hall   phone: (919) 599-3530
mail: Campus Box #3300, UNC-CH, Chapel Hill, NC 27599
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor

Reply via email to