Re: [R] reading and frequency analysis of Spanish text

2009-08-05 Thread Sam Thomas
-Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Michael Friendly Sent: Wednesday, August 05, 2009 2:19 PM To: R-Help Subject: [R] reading and frequency analysis of Spanish text For an historical paper I'm working on, I have so

Re: [R] reading and frequency analysis of Spanish text

2009-08-05 Thread David Winsemius
When I open that link in OpenOffice.org Writer and then save in "Text encoded" format with "Unicode" encoding, the diacriticals (is that the correct font-ish term?) seem to remain intact wehn re-opended. When I read that file in, not with scan() but with readLines(), here is what I get for

[R] reading and frequency analysis of Spanish text

2009-08-05 Thread Michael Friendly
For an historical paper I'm working on, I have some Spanish plaintext, presently in the form of a Word .doc file, http://euclid.psych.yorku.ca/SCS/Gallery/images/Private/Langren/Verdadera-spanish-stripped.doc and also some ciphered text from the same original source. The ultimate goal is to u