Hi,
I have two really large datasets (5,00,000 observations each) and I have been using the command "matchit id allNames using "xyz.dta", idusing(familyid) txtusing(allNamesFamily)" to match names between these two datasets.
However, matchit is taking a really really long time to carry out the fuzzy match (almost 24 hours). I have decided to run the same command but on smaller groups now however I am not sure how to create a loop function for it.
Essentially,
I want STATA to create district-level groups, carry out the above matchit command for each group, and save all the results together. The idea is that matchit only has to look for searches within each of these district-level sub-groups and not the whole dataset.
Can anyone help me with writing such a loop?
Related Posts with How to create a loop function to implement fuzzy matching on a large dataset?
Splitting messy string varsHello, I am hoping to split up these string vars and generate binary new vars based on whether they …
duplicates report function with more details and recommended ressoures for learning STATAHi, I am learning STATA and already acquired some basic knowledge of data processing, transformatio…
where does collect store its matrices?Stata 17.0 has the collect command. I wonder where does it store its intermediate and final output. …
Help with test for trend in proportion using survey weightHi Statalist members, I am having difficulty with testing for trends in proportion using survey wei…
Help with mergingHello, I have been working with a data set that uses NHANES data from 2003-2012. I wanted to add dat…
Subscribe to:
Post Comments (Atom)
0 Response to How to create a loop function to implement fuzzy matching on a large dataset?
Post a Comment