Cannot reproduce this bug with the given sample. (file_5.08-1) Did it originally use another encoding than UTF-8?
$ file hungarian.txt hungarian.txt: UTF-8 Unicode text $ sha1sum hungarian.txt 3d1eba4eda2e8596f20f7321b3c36f9e22c18bca hungarian.txt -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org