On 6 sep, 15:25, [email protected] (Shawn H Corey) wrote:
> On Mon, 2010-09-06 at 15:10 +0200, Pierre Nugues wrote:
>
> > I wrote a simple tokenizer for texts containing Latin9 characters. It
> > does not behave as expected with the Swedish text below and I would
> > like to find a workaround.
>
> Add these lines to top of your program:
>
> use strict;
> use warnings;
>
> binmode STDIN, 'encoding(utf8)';
> binmode STDOUT, 'encoding(utf8)';

There is also utf8 in the perl sourcecode, therefore you should add

use utf8;


-- 
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
http://learn.perl.org/


Reply via email to