Hello Unfortunately, licensecheck is still broken: the encoding detection based on `file` command does not work as expected.
For instance, in icedove. the file mozilla/js/src/vtune/jitprofiling.h contains windows-1252 character in a comment: $ file --mime-type --mime-encoding mozilla/js/src/vtune/jitprofiling.h mozilla/js/src/vtune/jitprofiling.h: text/plain; charset=unknown-8bit By default, licensecheck refuses to parse files with unknown encoding. I'm now convinced that using `file` to detect charset is more trouble than it's worth. Unless someone has a better idea, I'm going to revert most of the modification I've done with regard to encoding. licensecheck will go back to: - use a regexp to decide whether to parse a file or not - don't change encoding of copyright owners. The latter means that user will have to deal with any non utf8 character produced by licensecheck. All the best -- https://github.com/dod38fr/ -o- http://search.cpan.org/~ddumont/ http://ddumont.wordpress.com/ -o- irc: dod at irc.debian.org