You can convert back to UTF-8:

value <- unlist(xpathApply(doc,"//MESSUNG/BEZEICHNUNG", xmlValue))
Encoding(value) <- "UTF-8"

On Thu, Sep 3, 2009 at 7:56 AM, Dominik Bänninger <bae...@kiecks.de> wrote:

> Dear list
> I tried to read an xml file using the xml package. Unfortunately, some
> encoding problems occure. E.g. german Umlaut will be red correctly. I assume
> that the occurs due to (internal?) conversion to utf-8. To illustrate the
> problem, I have wrote to xml files.
> File Test 1
> -----------
> <?xml version="1.0" encoding="ISO-8859-1"?>
> <Daten>
>  <ITEM>
>  <Messdaten>
>    <MESSUNG>
>    <BEZEICHNUNG>Länge</BEZEICHNUNG>
>   </MESSUNG>
>  </Messdaten>
>  </ITEM>
> </Daten>
>
> File Test 2
> -----------
> <?xml version="1.0" encoding="utf-8"?>
> <Daten>
>  <ITEM>
>  <Messdaten>
>    <MESSUNG>
>    <BEZEICHNUNG>Länge</BEZEICHNUNG>
>   </MESSUNG>
>  </Messdaten>
>  </ITEM>
> </Daten>
> The following code shows that i cannot extract the Value of  "Länge"
> correctly. Any help is very welcom.
> ----------------------------------------  Code Start ------------
> > fname1 <- "test1.xml"
> > fname2 <- "test2.xml"
> > doc <- xmlTreeParse(fname1,useInternalNodes=T)
> > show(doc)
> <?xml version="1.0" encoding="ISO-8859-1"?>
> <Daten>
>  <ITEM>
>    <Messdaten>
>      <MESSUNG>
>        <BEZEICHNUNG>Länge</BEZEICHNUNG>
>      </MESSUNG>
>    </Messdaten>
>  </ITEM>
> </Daten>
> > xpathApply(doc,"//MESSUNG/BEZEICHNUNG", xmlValue)
> [[1]]
> [1] "Länge"
> > doc <- xmlTreeParse(fname2,useInternalNodes=T)
> > show(doc)
> <?xml version="1.0" encoding="utf-8"?>
> <Daten>
>  <ITEM>
>    <Messdaten>
>      <MESSUNG>
>        <BEZEICHNUNG>Länge</BEZEICHNUNG>
>      </MESSUNG>
>    </Messdaten>
>  </ITEM>
> </Daten>
> > xpathApply(doc,"//MESSUNG/BEZEICHNUNG", xmlValue)
> [[1]]
> [1] "Länge"
> ----------------------------------------  Code Stop ------------
> I am using the following system.
> R version 2.9.1 (2008-06-26)
> Copyright (C) 2008 The R Foundation for Statistical Computing
> ISBN 3-900051-07-0
>
> Microsoft Windows XP
> XML_2.6-0.zip downloaded from
> http://www.stats.ox.ac.uk/pub/RWin/bin/windows/contrib/2.9/
>
>        [[alternative(swapped) HTML version deleted]]
>
>
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
>


-- 
Henrique Dallazuanna
Curitiba-Paraná-Brasil
25° 25' 40" S 49° 16' 22" O

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to