Package: whois
Version: 5.0.18
Severity: normal
Tags: patch

This was originally reported in Ubuntu here:

https://bugs.launchpad.net/ubuntu/+source/whois/+bug/1040586

The attached patch changes the expected encoding from koi8-u to utf-8.

I was unable to find a domain in .ua that had koi8-u encoded characters in
its response, but I found several which provided multi-byte UTF-8 chars:

kiev.ua
search.ua

In order to test, one can simply open a UTF-8 encoding terminal with
LANG set to a utf-8 language (such as en_US.UTF-8) and run 'whois
search.ua'. They will be garbled because they are being decoded as koi8-u
and then re-encoded as UTF-8.

LANG=C whois search.ua

The characters appear as proper multi-byte cyrillic characters.

Also the Ubuntu user who reported this has passed on a message from the
administrators that the encoding is utf-8.

-- System Information:
Debian Release: wheezy/sid
  APT prefers quantal-updates
  APT policy: (500, 'quantal-updates'), (500, 'quantal-security'), (500, 
'quantal'), (400, 'precise-proposed')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 3.5.0-10-generic (SMP w/1 CPU core)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages whois depends on:
ii  libc6     2.15-0ubuntu17
ii  libidn11  1.25-2

whois recommends no packages.

whois suggests no packages.

-- no debconf information
=== modified file 'servers_charset_list'
--- a/servers_charset_list	2012-07-22 20:35:18 +0000
+++ b/servers_charset_list	2012-09-04 23:15:12 +0000
@@ -48,6 +48,6 @@
 whois.nic.tr		utf-8
 whois.twnic.net.tw	utf-8
 whois.pp.ua		utf-8
-whois.ua		koi8-u
+whois.ua		utf-8
 whois.nic.org.uy	utf-8
 

Reply via email to