On 13/05/2013 12:05 PM, Spencer Graves wrote:
Hello:
How can one match names containing non-English characters that
appear differently in different but related data files? For example, I
have data on Raúl Grijalva, who represents the third district of Arizona
in the US House of Represe
Build a lookup table for your data.
I think it is a fools errand to think that you can automatically "normalize"
arbitrary Unicode characters to an ASCII form that everyone will agree on.
BTW: To avoid propagating open joins your data should probably have some kind
of id for the term those Repr
Hello:
How can one match names containing non-English characters that
appear differently in different but related data files? For example, I
have data on Raúl Grijalva, who represents the third district of Arizona
in the US House of Representatives. This first name appears as "Raúl"
3 matches
Mail list logo