Hi,
I am using two datasets with the aim to merge them according to the name of the place. The first dataset is the 2011 South African data, whereby the variable "sp_name" (which i've made all lower case) has unique individual names of small places in South Africa. The other data set has the variable "suburb" - these individual variable names are not entirely dissimilar to those of sp_name, however there are some cases of misspelling and mismatch. For instance, there are variables for sp_name such as "xx sp" and then for suburb it would just be "xx."
The other problem is that there is a mismatch in the number of observations for each, some missing etc.
I have attempted to understand the reclink and matchit command without much success in using them - I would get the return that "the variable cannot be found" although I did follow the same syntax.
Am I missing something with this problem that would allow for me to solve it conceptually?
Thank you
Related Posts with Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit
Mundlak device for the time dimensionDear all, This is a question not directly about stata application. I notice there are many topics a…
How to Properly Format a Loop CommandHi there, I am fairly new to Stata but especially new to loop commands. I am trying to replicate a l…
Using -reshape- to create male/female versions of key variablesHi Statalist. I would like to -reshape- some of my data from long to wide in order to apply the 'gen…
strange variablesI have a dataset with strange variables and I can't work with it. tried "destring", "split" in vain …
Effect of fixed-, random effects models or regression model with robust standard errors on the "regression model"Hello! If i run a regression with the xtreg command on panel data using different models such as fi…
Subscribe to:
Post Comments (Atom)
0 Response to Merging two datasets by different variable names (fuzzy match) - help with reclink and matchit
Post a Comment