I have two datasets describing the same 80 schools.
Some variables are similar -- for example, both lists include the schools' enrollment and % African American -- but there is no common ID or other variable that matches the two lists perfectly.
Is there a command that will find the best way to link the two datasets?
If not, I've done much of the work. I've made an 80x80=6400 row dataset that gives the Mahalanobis distance between every possible combination of a row from the first dataset with a row from the second. Now how do I optimally sort and deduplicate 6400 rows to get the best 80 matches, with no duplicates from either list?
Related Posts with Record linkage with quantitative variables
How to calculate OLS R-square from FE model estimationHi all, The FE model estimates R-squares corresponding to: within, between and overall equations. H…
Time trends in panel dataHi master, I have the following regression of panel data using fixed effects, I was wondering after…
Dropping duplicated pairs of two variablesI have a dataset which contains three variables: CityA, CityB and Dstance. See the following example…
convert address to coordinatesHi there - I have a list of addresses (street number, street name, county, city, zipcode) in the U.…
Subtract number and character separately from a string variableI have a string variable (Size) that contains numbers and characters. I want to subtract number and …
Subscribe to:
Post Comments (Atom)
0 Response to Record linkage with quantitative variables
Post a Comment