On Fri, Dec 02, 2005 at 08:40:19PM +0100, Alain Bench wrote: > >> remove LC_CTYPE, and set back LANG=zh_CN.GBK. > > do you mean this: > >| $ locale > >| LANG=zh_CN.GBK > >| LC_CTYPE= > >| LC_ALL= > > Almost: Yet unset LC_CTYPE, or rather remove it from startup > scripts. And verify LC_ALL is also really unset (and not set to empty > string as was LC_CTYPE).
if I did this: | [EMAIL PROTECTED]:~$ export LC_ALL=C | [EMAIL PROTECTED]:~$ locale | LANG=zh_CN.GBK | LC_CTYPE="C" | LC_NUMERIC="C" | LC_TIME="C" | LC_COLLATE="C" | LC_MONETARY="C" | LC_MESSAGES="C" | LC_PAPER="C" | LC_NAME="C" | LC_ADDRESS="C" | LC_TELEPHONE="C" | LC_MEASUREMENT="C" | LC_IDENTIFICATION="C" | LC_ALL=C then no segfault happened. But if | [EMAIL PROTECTED]:~$ export LC_ALL= | [EMAIL PROTECTED]:~$ locale | LANG=zh_CN.GBK | LC_CTYPE= | LC_NUMERIC="zh_CN.GBK" | LC_TIME="zh_CN.GBK" | LC_COLLATE="zh_CN.GBK" | LC_MONETARY="zh_CN.GBK" | LC_MESSAGES="zh_CN.GBK" | LC_PAPER="zh_CN.GBK" | LC_NAME="zh_CN.GBK" | LC_ADDRESS="zh_CN.GBK" | LC_TELEPHONE="zh_CN.GBK" | LC_MEASUREMENT="zh_CN.GBK" | LC_IDENTIFICATION="zh_CN.GBK" | LC_ALL= It will result to segfault. Then | [EMAIL PROTECTED]:~$ export LC_CTYPE=C | [EMAIL PROTECTED]:~$ locale | LANG=zh_CN.GBK | LC_CTYPE=C | LC_NUMERIC="zh_CN.GBK" | LC_TIME="zh_CN.GBK" | LC_COLLATE="zh_CN.GBK" | LC_MONETARY="zh_CN.GBK" | LC_MESSAGES="zh_CN.GBK" | LC_PAPER="zh_CN.GBK" | LC_NAME="zh_CN.GBK" | LC_ADDRESS="zh_CN.GBK" | LC_TELEPHONE="zh_CN.GBK" | LC_MEASUREMENT="zh_CN.GBK" | LC_IDENTIFICATION="zh_CN.GBK" | LC_ALL= It did not segfault, and print the iso char as: | Rog\250\246rio Brito Did this mean 1) The library think this char is in GBK charset. 2) This char cannot be found in my GBK fonts. or other explanation? But I cannot use this setting for my usually usage, for It cannot display all the Chinese Char correctly. > > > >>| $ printf "\xE9" | grep "^Subject.*" > > It print nothing by this line, nor by pring "\xE9" > > And what about: > > | $ printf "Rog\xE9rio\n" | egrep "^R.*" > | $ printf "Rog\xA8\xA6rio\n" | egrep "^R.*" | [EMAIL PROTECTED]:~$ printf "Rog\xE9rio\n" | egrep "^R.*" | Rog閞io | [EMAIL PROTECTED]:~$ printf "Rog\xA8\xA6rio\n" | egrep "^R.*" | Rogério | [EMAIL PROTECTED]:~$ The first one print a Chinese char between g and i , while the second one print an latin char with accent between g and r. Both are working without segfault. > >> have to be reassigned. ?Dato? > ^ > Note my previous mail was ISO-8859-1, because of this one character > before "Dato". Character U+00BF not existing in GBK, therefore masked by > a question mark for you on display, and quoted in your reply. In these > conditions, it didn't segfault. Sorry for the annoying. And hope we can solve it. Regards, Wang Xu > > > Bye! Alain. > -- > Mutt muttrc tip to send mails in best adapted first necessary and sufficient > charset (version for Western Latin-1/Latin-9/CP-850/CP-1252 terminal users): > set send_charset="us-ascii:iso-8859-1:iso-8859-15:windows-1252:utf-8" -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]