BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Record linkage with quantitative variables
Record linkage with quantitative variables

I have two datasets describing the same 80 schools.

Some variables are similar -- for example, both lists include the schools' enrollment and % African American -- but there is no common ID or other variable that matches the two lists perfectly.

Is there a command that will find the best way to link the two datasets?

If not, I've done much of the work. I've made an 80x80=6400 row dataset that gives the Mahalanobis distance between every possible combination of a row from the first dataset with a row from the second. Now how do I optimally sort and deduplicate 6400 rows to get the best 80 matches, with no duplicates from either list?

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Record linkage with quantitative variables
Record linkage with quantitative variables

0 Response to Record linkage with quantitative variables

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Record linkage with quantitative variables Record linkage with quantitative variables

0 Response to Record linkage with quantitative variables