Followup-For: Bug #313411 Package: gawk I tried to test this, and it seems that at this moment, this behavior is only seen when using the C locale. I got:
$ cat example.txt | LC_ALL=C awk '{ printf "%-5s%s\n",$1, $2 }' A Only_a_singlebyte_character_here_(UTF-8:_41) Ö A_letter_which_takes_two_bytes_(UTF-8:_c3_96) € A_currency_symbol_which_takes_three_bytes_(UTF-8:_e2_82_ac) $ cat example.txt | LC_ALL=en_US.UTF-8 awk '{ printf "%-5s%s\n",$1, $2 }' A Only_a_singlebyte_character_here_(UTF-8:_41) Ö A_letter_which_takes_two_bytes_(UTF-8:_c3_96) € A_currency_symbol_which_takes_three_bytes_(UTF-8:_e2_82_ac) It therefore seems to me that the current behavior is valid, and to be expected. Regards, Rogier. -- System Information: Debian Release: wheezy/sid APT prefers squeeze-updates APT policy: (500, 'squeeze-updates'), (500, 'testing') Architecture: i386 (x86_64) Kernel: Linux 2.6.32-5-amd64 (SMP w/1 CPU core) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages gawk depends on: ii libc6 2.11.2-11 Embedded GNU C Library: Shared lib gawk recommends no packages. gawk suggests no packages. -- no debconf information -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org