On Wed, Jul 13, 2016 at 06:11:44PM +0200, Adam Borowski wrote: > "unicode -r ' S$'" doesn't return any output, despite the regexp being > expected to match: > U+0053 LATIN CAPITAL LETTER S > U+0073 LATIN SMALL LETTER S > U+00DF LATIN SMALL LETTER SHARP S > and many others.
It's clear why it is happening - the matching works on /usr/share/uni{code,data}/UnicodeData.txt, where the character name ends with a semicolon. It would be rather easy to fix, at the price of slowing down the search... so I am a bit at a loss what should be the best approach. best, -- ----------------------------------------------------------- | Radovan GarabĂk http://kassiopeia.juls.savba.sk/~garabik/ | | __..--^^^--..__ garabik @ kassiopeia.juls.savba.sk | ----------------------------------------------------------- Antivirus alert: file .signature infected by signature virus. Hi! I'm a signature virus! Copy me into your signature file to help me spread!
signature.asc
Description: PGP signature