I have two datasets describing the same 80 schools.
Some variables are similar -- for example, both lists include the schools' enrollment and % African American -- but there is no common ID or other variable that matches the two lists perfectly.
Is there a command that will find the best way to link the two datasets?
If not, I've done much of the work. I've made an 80x80=6400 row dataset that gives the Mahalanobis distance between every possible combination of a row from the first dataset with a row from the second. Now how do I optimally sort and deduplicate 6400 rows to get the best 80 matches, with no duplicates from either list?
Related Posts with Record linkage with quantitative variables
Average over a period of timeHi All, I have been using multiple real-time devices to collect air measurements., Now that I have …
Heckman modelHello every one first: i know two ways to use Heckman test, (1) by generate dummy variable as follow…
mi impute chained error msg: right-hand variables or weights missing values--but they are notHello, I'm using the mi impute chained command on Stata SE 15.1. After encountering the perfect pre…
merge problem: observations are matched but I get 'no observations' for variables of interestGreetings, I'm running Stata 15.1 on OSX. I'm working with a cumulative longitudinal dataset (the A…
🐛 Stata crashes on command executionThe following example demonstrates a sequence of commands that when executed crash Stata without pos…
Subscribe to:
Post Comments (Atom)
0 Response to Record linkage with quantitative variables
Post a Comment