Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit

Hi,

I am using two datasets with the aim to merge them according to the name of the place. The first dataset is the 2011 South African data, whereby the variable "sp_name" (which i've made all lower case) has unique individual names of small places in South Africa. The other data set has the variable "suburb" - these individual variable names are not entirely dissimilar to those of sp_name, however there are some cases of misspelling and mismatch. For instance, there are variables for sp_name such as "xx sp" and then for suburb it would just be "xx."
The other problem is that there is a mismatch in the number of observations for each, some missing etc.
I have attempted to understand the reclink and matchit command without much success in using them - I would get the return that "the variable cannot be found" although I did follow the same syntax.
Am I missing something with this problem that would allow for me to solve it conceptually?

Thank you

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit
Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit

0 Response to Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit

Related Posts with Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit

0 Response to Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit
Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit