On 6 sep, 15:25, [email protected] (Shawn H Corey) wrote: > On Mon, 2010-09-06 at 15:10 +0200, Pierre Nugues wrote: > > > I wrote a simple tokenizer for texts containing Latin9 characters. It > > does not behave as expected with the Swedish text below and I would > > like to find a workaround. > > Add these lines to top of your program: > > use strict; > use warnings; > > binmode STDIN, 'encoding(utf8)'; > binmode STDOUT, 'encoding(utf8)';
There is also utf8 in the perl sourcecode, therefore you should add use utf8; -- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected] http://learn.perl.org/
