On Fri, Dec 02, 2005 at 08:40:19PM +0100, Alain Bench wrote:
> >> remove LC_CTYPE, and set back LANG=zh_CN.GBK.
> > do you mean this:
> >| $ locale
> >| LANG=zh_CN.GBK
> >| LC_CTYPE=
> >| LC_ALL=
> 
>     Almost: Yet unset LC_CTYPE, or rather remove it from startup
> scripts. And verify LC_ALL is also really unset (and not set to empty
> string as was LC_CTYPE).

if I did this:
| [EMAIL PROTECTED]:~$ export LC_ALL=C
| [EMAIL PROTECTED]:~$ locale
| LANG=zh_CN.GBK
| LC_CTYPE="C"
| LC_NUMERIC="C"
| LC_TIME="C"
| LC_COLLATE="C"
| LC_MONETARY="C"
| LC_MESSAGES="C"
| LC_PAPER="C"
| LC_NAME="C"
| LC_ADDRESS="C"
| LC_TELEPHONE="C"
| LC_MEASUREMENT="C"
| LC_IDENTIFICATION="C"
| LC_ALL=C
then no segfault happened.
But if
| [EMAIL PROTECTED]:~$ export LC_ALL=
| [EMAIL PROTECTED]:~$ locale
| LANG=zh_CN.GBK
| LC_CTYPE=
| LC_NUMERIC="zh_CN.GBK"
| LC_TIME="zh_CN.GBK"
| LC_COLLATE="zh_CN.GBK"
| LC_MONETARY="zh_CN.GBK"
| LC_MESSAGES="zh_CN.GBK"
| LC_PAPER="zh_CN.GBK"
| LC_NAME="zh_CN.GBK"
| LC_ADDRESS="zh_CN.GBK"
| LC_TELEPHONE="zh_CN.GBK"
| LC_MEASUREMENT="zh_CN.GBK"
| LC_IDENTIFICATION="zh_CN.GBK"
| LC_ALL=
It will result to segfault.
Then
| [EMAIL PROTECTED]:~$ export LC_CTYPE=C
| [EMAIL PROTECTED]:~$ locale
| LANG=zh_CN.GBK
| LC_CTYPE=C
| LC_NUMERIC="zh_CN.GBK"
| LC_TIME="zh_CN.GBK"
| LC_COLLATE="zh_CN.GBK"
| LC_MONETARY="zh_CN.GBK"
| LC_MESSAGES="zh_CN.GBK"
| LC_PAPER="zh_CN.GBK"
| LC_NAME="zh_CN.GBK"
| LC_ADDRESS="zh_CN.GBK"
| LC_TELEPHONE="zh_CN.GBK"
| LC_MEASUREMENT="zh_CN.GBK"
| LC_IDENTIFICATION="zh_CN.GBK"
| LC_ALL=
It did not segfault, and print the iso char as:
| Rog\250\246rio Brito

Did this mean 
1) The library think this char is in GBK charset.
2) This char cannot be found in my GBK fonts.
or other explanation?

But I cannot use this setting for my usually usage, for It cannot
display all the Chinese Char correctly.

> 
> 
> >>| $ printf "\xE9" | grep "^Subject.*"
> > It print nothing by this line, nor by pring "\xE9"
> 
>     And what about:
> 
> | $ printf "Rog\xE9rio\n" | egrep "^R.*"
> | $ printf "Rog\xA8\xA6rio\n" | egrep "^R.*"

| [EMAIL PROTECTED]:~$ printf "Rog\xE9rio\n" | egrep "^R.*"
| Rog閞io
| [EMAIL PROTECTED]:~$ printf "Rog\xA8\xA6rio\n" | egrep "^R.*"
| Rogério
| [EMAIL PROTECTED]:~$

The first one print a Chinese char between g and i , while the second
one print an latin char with accent between g and r. Both are working
without segfault.

> >> have to be reassigned. ?Dato?
>                           ^
>     Note my previous mail was ISO-8859-1, because of this one character
> before "Dato". Character U+00BF not existing in GBK, therefore masked by
> a question mark for you on display, and quoted in your reply. In these
> conditions, it didn't segfault.

Sorry for the annoying. And hope we can solve it.

Regards,
Wang Xu
> 
> 
> Bye!  Alain.
> -- 
> Mutt muttrc tip to send mails in best adapted first necessary and sufficient
> charset (version for Western Latin-1/Latin-9/CP-850/CP-1252 terminal users):
> set send_charset="us-ascii:iso-8859-1:iso-8859-15:windows-1252:utf-8"


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to