[Rd] iconv documentation error

Therneau, Terry M., Ph.D. Fri, 31 Mar 2017 10:05:24 -0700

This caught us yesterday when a string that we assumed to be in UTF-8 was actually usingCP1252. (This came from an internal web based service, so the root cause is not R'sfault.) The help page for iconv states that the result of an invalid conversion is NAonly when the toRaw argument is TRUE, but this appears to be true in general.


Example:

test1 <- "Ménière's disease" # the offending string (it was buried in a 13000character result string)test2 <- iconv(test1, to="CP1252") # create a version of the string that is inWindow-1252 coding

iconv(test2, from="UTF-8")          # reprise our error
[1] NA

Note that Encoding(test2) returns "latin-1", which is also not quite in alignment with thehelp page.


______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

[Rd] iconv documentation error

Reply via email to