You can convert back to UTF-8: value <- unlist(xpathApply(doc,"//MESSUNG/BEZEICHNUNG", xmlValue)) Encoding(value) <- "UTF-8"
On Thu, Sep 3, 2009 at 7:56 AM, Dominik Bänninger <bae...@kiecks.de> wrote: > Dear list > I tried to read an xml file using the xml package. Unfortunately, some > encoding problems occure. E.g. german Umlaut will be red correctly. I assume > that the occurs due to (internal?) conversion to utf-8. To illustrate the > problem, I have wrote to xml files. > File Test 1 > ----------- > <?xml version="1.0" encoding="ISO-8859-1"?> > <Daten> > <ITEM> > <Messdaten> > <MESSUNG> > <BEZEICHNUNG>Länge</BEZEICHNUNG> > </MESSUNG> > </Messdaten> > </ITEM> > </Daten> > > File Test 2 > ----------- > <?xml version="1.0" encoding="utf-8"?> > <Daten> > <ITEM> > <Messdaten> > <MESSUNG> > <BEZEICHNUNG>Länge</BEZEICHNUNG> > </MESSUNG> > </Messdaten> > </ITEM> > </Daten> > The following code shows that i cannot extract the Value of "Länge" > correctly. Any help is very welcom. > ---------------------------------------- Code Start ------------ > > fname1 <- "test1.xml" > > fname2 <- "test2.xml" > > doc <- xmlTreeParse(fname1,useInternalNodes=T) > > show(doc) > <?xml version="1.0" encoding="ISO-8859-1"?> > <Daten> > <ITEM> > <Messdaten> > <MESSUNG> > <BEZEICHNUNG>Länge</BEZEICHNUNG> > </MESSUNG> > </Messdaten> > </ITEM> > </Daten> > > xpathApply(doc,"//MESSUNG/BEZEICHNUNG", xmlValue) > [[1]] > [1] "Länge" > > doc <- xmlTreeParse(fname2,useInternalNodes=T) > > show(doc) > <?xml version="1.0" encoding="utf-8"?> > <Daten> > <ITEM> > <Messdaten> > <MESSUNG> > <BEZEICHNUNG>Länge</BEZEICHNUNG> > </MESSUNG> > </Messdaten> > </ITEM> > </Daten> > > xpathApply(doc,"//MESSUNG/BEZEICHNUNG", xmlValue) > [[1]] > [1] "Länge" > ---------------------------------------- Code Stop ------------ > I am using the following system. > R version 2.9.1 (2008-06-26) > Copyright (C) 2008 The R Foundation for Statistical Computing > ISBN 3-900051-07-0 > > Microsoft Windows XP > XML_2.6-0.zip downloaded from > http://www.stats.ox.ac.uk/pub/RWin/bin/windows/contrib/2.9/ > > [[alternative(swapped) HTML version deleted]] > > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > > -- Henrique Dallazuanna Curitiba-Paraná-Brasil 25° 25' 40" S 49° 16' 22" O [[alternative HTML version deleted]]
______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.