[Bug 821951] Re: sort -u erase some utf8 characters

2011-11-16 Thread Ubuntu Foundation's Bug Bot
The attachment "iso14651_t1.diff" of this bug report has been identified as being a patch. The ubuntu-reviewers team has been subscribed to the bug report so that they can review the patch. In the event that this is in fact not a patch you can resolve this situation by removing the tag 'patch' fr

[Bug 821951] Re: sort -u erase some utf8 characters

2011-11-16 Thread Martin Pitt
** Changed in: langpack-locales (Ubuntu) Importance: Undecided => High ** Changed in: langpack-locales (Ubuntu) Status: Confirmed => Triaged -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/821

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-07 Thread An Yang
This patch can fix this bug, when sort -u was executed in any LANG except for zh_CN. ** Patch added: "iso14651_t1.diff" https://bugs.launchpad.net/ubuntu/+source/langpack-locales/+bug/821951/+attachment/2260546/+files/iso14651_t1.diff -- You received this bug notification because you are a m

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-07 Thread An Yang
Something is wrong in iso14651_t1_pinyin and iso14651_t1 ** Package changed: eglibc (Ubuntu) => langpack-locales (Ubuntu) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/821951 Title: sort -u erase s

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread Bug Watch Updater
Launchpad has imported 2 comments from the remote bug at http://sourceware.org/bugzilla/show_bug.cgi?id=13063. If you reply to an imported comment from within Launchpad, your comment will be sent to the remote bug automatically. Read more about Launchpad's inter-bugtracker facilities at https://he

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
** Bug watch added: Sourceware.org Bugzilla #13063 http://sourceware.org/bugzilla/show_bug.cgi?id=13063 ** Also affects: eglibc via http://sourceware.org/bugzilla/show_bug.cgi?id=13063 Importance: Unknown Status: Unknown -- You received this bug notification because you are a mem

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
All of the lost 686 Chinese characters locate in CJK UNIFIED IDEOGRAPH EXTENSION A block. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/821951 Title: sort -u erase some utf8 characters To manage no

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
Sorry, lost a word. EGlibc/glibc lack support of CJK UNIFIED IDEOGRAPH EXTENSION A/B/C/D defined in iso10646:2011. CJK UNIFIED IDEOGRAPH EXTENSION A is included in GB18030:2005, and GB18030:2005 is the China locale standard. ** Package changed: coreutils (Ubuntu) => eglibc (Ubuntu) ** Changed i

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
The reason is eglibc/glibc just supports CJK UNIFIED IDEOGRAPH (- ) defined in iso10646:1993. EGlibc/glibc lack support of CJK UNIFIED IDEOGRAPH A/B/C/D defined in iso10646:2011. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
If I set LANG to en_US.utf8, sort -u erase 2716 chinese characters. See attachment please. ** Patch added: "x.diff" https://bugs.launchpad.net/ubuntu/+source/coreutils/+bug/821951/+attachment/2258244/+files/x.diff -- You received this bug notification because you are a member of Ubuntu Bugs,

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
My locale is: LANG=zh_CN.utf8 LANGUAGE=zh_CN:zh LC_CTYPE="zh_CN.utf8" LC_NUMERIC="zh_CN.utf8" LC_TIME="zh_CN.utf8" LC_COLLATE="zh_CN.utf8" LC_MONETARY="zh_CN.utf8" LC_MESSAGES="zh_CN.utf8" LC_PAPER="zh_CN.utf8" LC_NAME="zh_CN.utf8" LC_ADDRESS="zh_CN.utf8" LC_TELEPHONE="zh_CN.utf8" LC_MEASUREMENT="

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
my result of sort -u x.sorted.utf8 > x.sorted.uniq.utf8 I do this in lucid and natty, got the same problem. ** Attachment added: "x.sorted.uniq.utf8" https://bugs.launchpad.net/ubuntu/+source/coreutils/+bug/821951/+attachment/2258240/+files/x.sorted.uniq.utf8 -- You received this bug notifi

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
my x.diff file, sort -u erase 686 chinese characters. ** Patch added: "x.diff" https://bugs.launchpad.net/ubuntu/+source/coreutils/+bug/821951/+attachment/2258242/+files/x.diff -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https

[Bug 821951] Re: sort -u erase some utf8 characters

2011-08-06 Thread An Yang
** Attachment added: "orig source file" https://bugs.launchpad.net/bugs/821951/+attachment/2258214/+files/x.sorted.utf8 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/821951 Title: sort -u erase