On Wed, Jul 13, 2016 at 06:11:44PM +0200, Adam Borowski wrote:
> "unicode -r ' S$'" doesn't return any output, despite the regexp being
> expected to match:
> U+0053 LATIN CAPITAL LETTER S
> U+0073 LATIN SMALL LETTER S
> U+00DF LATIN SMALL LETTER SHARP S
> and many others.

It's clear why it is happening - the matching works on
/usr/share/uni{code,data}/UnicodeData.txt, where the character name ends
with a semicolon.
It would be rather easy to fix, at the price of slowing down the
search... so I am a bit at a loss what should be the best approach.

best,
-- 
 -----------------------------------------------------------
| Radovan GarabĂ­k http://kassiopeia.juls.savba.sk/~garabik/ |
| __..--^^^--..__    garabik @ kassiopeia.juls.savba.sk     |
 -----------------------------------------------------------
Antivirus alert: file .signature infected by signature virus.
Hi! I'm a signature virus! Copy me into your signature file to help me spread!

Attachment: signature.asc
Description: PGP signature

Reply via email to