Greetings,

I need to conduct a merge on two databases containing information on
organizations, but the organization names are often non-identical and there
is no common unique identifier.  Does anyone know a good way to calculate a
similarity measure on two names, or even better is there a natural language
matching function in an R package?  I did some searches on this but must not
know the right keywords to search.

As an example, here are some possible non-identical names:

Oxfam,  Oxfam USA
American Services, Americam Services   -  (just mis-spelled)
Global Alliance for Action, Global Alliance for the Environment   -  (a
non-match)

Any suggestions are welcome!


-- 
View this message in context: 
http://old.nabble.com/merge-on-non-identical-names-tp26503346p26503346.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to