On Sun, Jan 23, 2005 at 08:40:48PM +0000, Ed Avis <[EMAIL PROTECTED]> wrote: > On Sun, 9 Jan 2005, Kenneth Pronovici wrote: > > >> <title lang="de">Das Geheimnis der Südseeperlen</title> > >> > >>Output: > >> > >> <title lang="de">Das Geheimnis der Südseeperlen</title> > > Is the XML prolog (or whatever it's called) the same for both the > input and output files?
The XML prolog says "iso-8859-1" for all files in question. (I wasn't aware of that before, and it surprises me, because xml parsers are not required to parse such documents, so that is at least a bit of a interoperability problem). > Are you sure the input isn't ISO-8859-1 and > the output UTF-8? I am sure that this *is* the case. That's the problem, in fact. Prolog says latin1 while output is utf-8. > Does it also happen with tv_cat? Yes, until I added "binmode STDIN; binmode STDOUT". It's likely that, on output, XMLTV doesn't use binmode and so is sensitive to the environment, which, in my case, prefers UTF-8 output, so perl will likely output utf-8, while xmltv outputs an encoding=iso-8859-1. -- The choice of a -----==- _GNU_ ----==-- _ generation Marc Lehmann ---==---(_)__ __ ____ __ [EMAIL PROTECTED] --==---/ / _ \/ // /\ \/ / http://schmorp.de/ -=====/_/_//_/\_,_/ /_/\_\ XX11-RIPE -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]