[OT] Re: accents

2011-10-02 Thread Stephane CHAZELAS
2011-08-25, 12:19(-07), Linda Walsh: [...] > ` Greg Wooledge wrote: >> On Wed, Aug 24, 2011 at 06:51:32PM -0700, Linda Walsh wrote: >> >>> BTW, Thomas -- what is the Character that comes after 'De' in your >>> name? I read it as hex '0xc282c2' which doesn't seem to be valid unicode. >>> >

Re: accents

2011-09-12 Thread Chet Ramey
On 9/12/11 9:22 AM, Thomas De Contes wrote: > on the other hand, if there is a way for bash to not display special > characters on the interactive line, > i suppose it can be useful to know how to do that in many cases, for example > if i switch on a new platform, waiting for making special char

Re: accents

2011-09-12 Thread Thomas De Contes
Le 25 août 2011 à 15:35, Chet Ramey a écrit : >> >> Le 25 août 2011 à 14:17, Chet Ramey a écrit : >> so, wcwidth is apple's responsibility, then no libre-software developer can do anything, Am I right ? >>> >>> To a certain extent, yes. wcwidth is a libc function, and it has a

Re: accents

2011-08-25 Thread Linda Walsh
` Greg Wooledge wrote: On Wed, Aug 24, 2011 at 06:51:32PM -0700, Linda Walsh wrote: BTW, Thomas -- what is the Character that comes after 'De' in your name? I read it as hex '0xc282c2' which doesn't seem to be valid unicode. RFC 2822 (section 2.2) says that Header Fields in an ema

Re: accents

2011-08-25 Thread Chet Ramey
> > Le 25 août 2011 à 14:17, Chet Ramey a écrit : > > >> so, wcwidth is apple's responsibility, then no libre-software developer > >> can do anything, > >> Am I right ? > > > > To a certain extent, yes. wcwidth is a libc function, and it has a bug > > on Mac OS X. > > Well, libc is a part of

Re: accents

2011-08-25 Thread Greg Wooledge
On Thu, Aug 25, 2011 at 03:09:04PM +0200, Thomas De Contes wrote: > Well, libc is a part of linux, isn't it ? libc is the generic name for an implementation of the library of functions required by various standards, including ISO/ANSI C, POSIX, and so on. It's usually named libc* and located in /l

Re: accents

2011-08-25 Thread Thomas De Contes
Le 25 août 2011 à 14:17, Chet Ramey a écrit : >> so, wcwidth is apple's responsibility, then no libre-software developer can >> do anything, >> Am I right ? > > To a certain extent, yes. wcwidth is a libc function, and it has a bug > on Mac OS X. Well, libc is a part of linux, isn't it ? Does

Re: accents

2011-08-25 Thread Thomas De Contes
Le 25 août 2011 à 14:36, Greg Wooledge a écrit : > On Wed, Aug 24, 2011 at 06:51:32PM -0700, Linda Walsh wrote: >> BTW, Thomas -- what is the Character that comes after 'De' in your >> name? I read it as hex '0xc282c2' which doesn't seem to be valid unicode. (it is NBSP (for address book)) >

Re: accents

2011-08-25 Thread Greg Wooledge
On Wed, Aug 24, 2011 at 06:51:32PM -0700, Linda Walsh wrote: > BTW, Thomas -- what is the Character that comes after 'De' in your > name? I read it as hex '0xc282c2' which doesn't seem to be valid unicode. RFC 2822 (section 2.2) says that Header Fields in an email must be composed of US-ASCII ch

Re: accents

2011-08-25 Thread Chet Ramey
> so, wcwidth is apple's responsibility, then no libre-software developer can > do anything, > Am I right ? To a certain extent, yes. wcwidth is a libc function, and it has a bug on Mac OS X. I will have a workaround in the next readline release. Chet -- ``The lyf so short, the craft so long

Re: accents

2011-08-24 Thread Linda Walsh
Chet Ramey wrote: In yours, however, it is 0x65 0xcc 0x81 which is U+0065 LATIN SMALL LETTER E followed by U+0301 COMBINING ACUTE ACCENT. That's not valid UTF-8, since UTF-8 requires that the shortest sequence be used to encode a character. This is exactly true... Valid UTF-8 is anythi

Re: accents

2011-08-24 Thread Thomas De Contes
Le 16 mai 2011 à 17:02, Chet Ramey a écrit : > On 5/9/11 10:46 AM, Thomas De Contes wrote: >> 1 >> - execute >> PS1="&# $PS1" >> - drag & drop the file with the accent >> - use "top arrow" and "bottom arrow" to move in the history : >> at each time you move on the line containing an accent, it ea

Re: accents

2011-05-16 Thread Chet Ramey
On 5/9/11 10:46 AM, Thomas De Contes wrote: > Configuration Information [Automatically generated, do not change]: > Machine: i386 > OS: darwin10.7.0 > Compiler: /usr/bin/gcc-4.2 > Compilation CFLAGS: -DPROGRAM='bash' -DCONF_HOSTTYPE='i386' > -DCONF_OSTYPE='darwin10.7.0' -DCONF_MACHTYPE='i386-appl

Re: accents

2011-05-16 Thread Andreas Schwab
Chet Ramey writes: > That's a non sequitor. My point is that, as I read it, UTF-8 requires the > use of the shortest sequence that can represent a particular character. The character is U+0301, and the shorted sequence is 0xcc 0x81. > In this case, that means that U+00E9 must be used to repres

Re: accents

2011-05-15 Thread Chet Ramey
On 5/15/11 6:16 PM, Chet Ramey wrote: > The other difference is that drag-and-drop on Mac OS X (at least dropping > from the finder) produces full pathnames. I was able to reproduce display > problems (which I haven't yet investigated) using that, but not using > tab completion in the way you did

Re: accents

2011-05-15 Thread Chet Ramey
On 5/15/11 6:38 PM, Andreas Schwab wrote: > Chet Ramey writes: > >> On 5/10/11 9:17 AM, Greg Wooledge wrote: >> >>> In yours, however, it is 0x65 0xcc 0x81 which is U+0065 LATIN SMALL >>> LETTER E followed by U+0301 COMBINING ACUTE ACCENT. >> >> That's not valid UTF-8, since UTF-8 requires that t

Re: accents

2011-05-15 Thread Andreas Schwab
Chet Ramey writes: > On 5/10/11 9:17 AM, Greg Wooledge wrote: > >> In yours, however, it is 0x65 0xcc 0x81 which is U+0065 LATIN SMALL >> LETTER E followed by U+0301 COMBINING ACUTE ACCENT. > > That's not valid UTF-8, since UTF-8 requires that the shortest sequence > be used to encode a character

Re: accents

2011-05-15 Thread Chet Ramey
On 5/10/11 9:17 AM, Greg Wooledge wrote: >>> Is the accented character >>> a single-byte character, or a multi-byte character, in your locale? >> >> a multi-byte character, i think >> How to confirm that ? (Keep in mind as you read my answers that I know very little more than anyone else about Un

Re: accents

2011-05-10 Thread Chet Ramey
On 5/10/11 9:17 AM, Greg Wooledge wrote: > In yours, however, it is 0x65 0xcc 0x81 which is U+0065 LATIN SMALL > LETTER E followed by U+0301 COMBINING ACUTE ACCENT. > > Perhaps Bash does not know how to interpret COMBINING ACUTE ACCENT which > follows a letter...? > > I'm not intimately familiar

Re: accents

2011-05-10 Thread Andreas Schwab
Perhaps the wcwidth function is broken on MacOS. Andreas. -- Andreas Schwab, sch...@linux-m68k.org GPG Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different."

Re: accents

2011-05-10 Thread Greg Wooledge
On Tue, May 10, 2011 at 04:47:29AM +0200, Thomas De Contes wrote: > tDeContes-fixe:~ thomas$ echo "$PS1" > + echo '\h:\W \u\$ ' > \h:\W \u\$ > if i do not > PS1="&# $PS1" > then i don't have the problem described in 1 I am not able to reproduce this in my environment. I'm using Debian 6.0 on i3

Re: accents

2011-05-09 Thread Thomas De Contes
Le 9 mai 2011 à 20:21, Greg Wooledge a écrit : > On Mon, May 09, 2011 at 04:46:14PM +0200, Thomas De Contes wrote: >> Description: >> >> 1 >> when i do >> PS1="&# $PS1" >> then I have problems since there is some accents in my command lines : > > What is the value of PS1 before you prepend ampe

Re: accents

2011-05-09 Thread Greg Wooledge
On Mon, May 09, 2011 at 04:46:14PM +0200, Thomas De Contes wrote: > Description: > > 1 > when i do > PS1="&# $PS1" > then I have problems since there is some accents in my command lines : What is the value of PS1 before you prepend ampersand-hash-space to it? What does the ampersand-hash-space h