Package: coreutils
Version: 8.21-1
Severity: important

The fold command doesn't support the UTF-8 encoding. I have the file,
which contains:

cat test.rst 
123456789 123456789 123456789
ľščťžýáíé ľščťžýáíé ľščťžýáíé

123456789 123456789 123456789 123456789
ľščťžýáíé ľščťžýáíé ľščťžýáíé ľščťžýáíé

And applying fold to set the the row length to 30 chars with this
result:

fold -w 30 test.rst 
123456789 123456789 123456789
ľščťžýáíé ľščťž
�áíé ľščťžýáíé

123456789 123456789 123456789 
123456789
ľščťžýáíé ľščťž
�áíé ľščťžýáíé ľ
�čťžýáíé

You can see, the some UTF-8 chars are broken (breaking UTF-8 2B chars).
When applying the space braking:

fold -sw 30 test.rst 
123456789 123456789 123456789
ľščťžýáíé 
ľščťžýáíé 
ľščťžýáíé

123456789 123456789 123456789 
123456789
ľščťžýáíé 
ľščťžýáíé 
ľščťžýáíé 
ľščťžýáíé

The chars are not breaked, but the file is not wrapped for 30 chars. I
expect, that the file will be wrapped as this:

123456789 123456789 123456789
ľščťžýáíé ľščťžýáíé ľščťžýáíé

123456789 123456789 123456789 
123456789
ľščťžýáíé ľščťžýáíé ľščťžýáíé 
ľščťžýáíé

regards

--- System information. ---
Architecture: amd64
Kernel:       Linux 3.10-2-amd64

Debian Release: jessie/sid
  500 testing         security.debian.org 
  500 testing         ftp.cz.debian.org 
  500 stable          deb.opera.com 
  200 testing         www.deb-multimedia.org 

I set the important severity, because nowaday Debian's default is UTF-8
in terminal.

regards

-- 
Slavko
http://slavino.sk

Attachment: signature.asc
Description: PGP signature

Reply via email to