Re: [R] Matching names with non-English characters

2013-05-13 Thread Duncan Murdoch
On 13/05/2013 12:05 PM, Spencer Graves wrote: Hello: How can one match names containing non-English characters that appear differently in different but related data files? For example, I have data on Raúl Grijalva, who represents the third district of Arizona in the US House of Represe

Re: [R] Matching names with non-English characters

2013-05-13 Thread Jeff Newmiller
Build a lookup table for your data. I think it is a fools errand to think that you can automatically "normalize" arbitrary Unicode characters to an ASCII form that everyone will agree on. BTW: To avoid propagating open joins your data should probably have some kind of id for the term those Repr

[R] Matching names with non-English characters

2013-05-13 Thread Spencer Graves
Hello: How can one match names containing non-English characters that appear differently in different but related data files? For example, I have data on Raúl Grijalva, who represents the third district of Arizona in the US House of Representatives. This first name appears as "Raúl"