Hi,
I have two really large datasets (5,00,000 observations each) and I have been using the command "matchit id allNames using "xyz.dta", idusing(familyid) txtusing(allNamesFamily)" to match names between these two datasets.
However, matchit is taking a really really long time to carry out the fuzzy match (almost 24 hours). I have decided to run the same command but on smaller groups now however I am not sure how to create a loop function for it.
Essentially,
I want STATA to create district-level groups, carry out the above matchit command for each group, and save all the results together. The idea is that matchit only has to look for searches within each of these district-level sub-groups and not the whole dataset.
Can anyone help me with writing such a loop?
Related Posts with How to create a loop function to implement fuzzy matching on a large dataset?
Print some observation excluding a variable Print the first 8 observations of the dataset, excluding the variable ROE …
Mean number of values combining two variables!!!Q6: What is the mean number of employees for companies that failed? And for companies that did not f…
t test in panel datahello there , I have an unbalanced panel data of banks comprising of 21 public,42 foreign and 21 pri…
Cox proportional hazards model with time varying covariates where PH assumption was violated.Hi, "Follow_up" (between 1 and 24 months) is the time variable. "Status" (0=IN, 1=OUT) is the failu…
Difference in differencesDear All I need some help with Difference-in Differences. I have two groups of firms, SOE and DNSO…
Subscribe to:
Post Comments (Atom)
0 Response to How to create a loop function to implement fuzzy matching on a large dataset?
Post a Comment