I have two datasets describing the same 80 schools.
Some variables are similar -- for example, both lists include the schools' enrollment and % African American -- but there is no common ID or other variable that matches the two lists perfectly.
Is there a command that will find the best way to link the two datasets?
If not, I've done much of the work. I've made an 80x80=6400 row dataset that gives the Mahalanobis distance between every possible combination of a row from the first dataset with a row from the second. Now how do I optimally sort and deduplicate 6400 rows to get the best 80 matches, with no duplicates from either list?
Related Posts with Record linkage with quantitative variables
Time DummyWhat is the command to test the before and after effect of a phenomenon, example price rise . The ye…
return list in "ineqdeco" command (user-wrriten)Hi, I am suffering from a problem. I want get the results when I run "ineqdeco" command, but I can't…
How do I get the direct, indirect, and total marginal effects from a probit model with spatial lags?I have searched for Stata commands or routines for spatial probit regression but I could not find on…
dbs: Stata command for computing double bootstrap confidence intervalsDear all, I am happy to announce a new Stata command, dbs, which allows the computation of double bo…
6 groups in one graphHello everyone, I want to have a graph with 6 groups in it. I am able to produce a graph with 6 sm…
Subscribe to:
Post Comments (Atom)
0 Response to Record linkage with quantitative variables
Post a Comment