Package: whois Version: 5.0.18 Severity: normal Tags: patch This was originally reported in Ubuntu here:
https://bugs.launchpad.net/ubuntu/+source/whois/+bug/1040586 The attached patch changes the expected encoding from koi8-u to utf-8. I was unable to find a domain in .ua that had koi8-u encoded characters in its response, but I found several which provided multi-byte UTF-8 chars: kiev.ua search.ua In order to test, one can simply open a UTF-8 encoding terminal with LANG set to a utf-8 language (such as en_US.UTF-8) and run 'whois search.ua'. They will be garbled because they are being decoded as koi8-u and then re-encoded as UTF-8. LANG=C whois search.ua The characters appear as proper multi-byte cyrillic characters. Also the Ubuntu user who reported this has passed on a message from the administrators that the encoding is utf-8. -- System Information: Debian Release: wheezy/sid APT prefers quantal-updates APT policy: (500, 'quantal-updates'), (500, 'quantal-security'), (500, 'quantal'), (400, 'precise-proposed') Architecture: amd64 (x86_64) Foreign Architectures: i386 Kernel: Linux 3.5.0-10-generic (SMP w/1 CPU core) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages whois depends on: ii libc6 2.15-0ubuntu17 ii libidn11 1.25-2 whois recommends no packages. whois suggests no packages. -- no debconf information
=== modified file 'servers_charset_list' --- a/servers_charset_list 2012-07-22 20:35:18 +0000 +++ b/servers_charset_list 2012-09-04 23:15:12 +0000 @@ -48,6 +48,6 @@ whois.nic.tr utf-8 whois.twnic.net.tw utf-8 whois.pp.ua utf-8 -whois.ua koi8-u +whois.ua utf-8 whois.nic.org.uy utf-8