Hi,
I have two really large datasets (5,00,000 observations each) and I have been using the command "matchit id allNames using "xyz.dta", idusing(familyid) txtusing(allNamesFamily)" to match names between these two datasets.
However, matchit is taking a really really long time to carry out the fuzzy match (almost 24 hours). I have decided to run the same command but on smaller groups now however I am not sure how to create a loop function for it.
Essentially,
I want STATA to create district-level groups, carry out the above matchit command for each group, and save all the results together. The idea is that matchit only has to look for searches within each of these district-level sub-groups and not the whole dataset.
Can anyone help me with writing such a loop?
Related Posts with How to create a loop function to implement fuzzy matching on a large dataset?
Thesis regressionGoodmorning all, For my masters thesis I have to do some cross-sectional regressions, using the mod…
regression panel data: OLS / GEEhello! I have a panel dataset on which I would like to perform a regression. my dependent variable …
looping over data filesHello Users, I have data of banks by id numbers. What I want to do is: (1) to merge 4 tables of one …
ARIMA Modelling with Binary Independent VariableI am trying to fit an ARIMA model to my intervention analysis but when I use the syntax and add both…
scatterplot with text labels for y and x?Dear All, I'd like to have a figure like below (need not to be exactly the same). Array I make up …
Subscribe to:
Post Comments (Atom)
0 Response to How to create a loop function to implement fuzzy matching on a large dataset?
Post a Comment