On 13/05/2011 4:57 PM, lyolya wrote:
Hello,

I am experiencing a problem in reading a database in Russian. The problem
appears when it comes to char variables. I have already tried changing the
encoding, i.e.

options(encoding="UTF-8")

and

options(encoding="KOI8-R")

but every time there appear to be something unreadable in the data frame,
like \x82\xa2\xae\xef etc.

Could you please answer whether it is possible to operate with Russian
strings in R, and, if yes, how to get to do that. Thank you, in advance.

Yes, it is possible. You can test it using a text editor that supports Russian. Just put

x <- " some Russian text "

into the file, the use source() to read the filename. Two things are likely outcomes:

x will be defined to be a string holding Russian text, and it will display properly.

OR

it will be defined to be a string with lots of escapes or mis-displayed characters in it. In the latter case, the problem is that R is assuming a different encoding than your text editor. The l10n_info() will display information about what R is expecting.

If none of the above helps you to get your code working, then you'll have to give details on exactly what you're doing to read the file, and exactly what is in the file.

Duncan Murdoch

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to