I have two datasets. One dataset has families of 2 members and the other dataset has families of uneven sizes. In all the names, there are often minor spelling mistakes so often, we will not get exact matches.
How can I match the 2-member family in the first dataset with the n-member family in the second dataset? The names in the 2-member family must exist in the n-member family although it will not exist exactly due to spelling mistakes.
In some sense, I am trying to find a way to tell STATA to :
“Match if 4 words in the 2-member and n-member families are 80% similar”. I say 4 words accounting for 2 first names and 2 last names in the first dataset for each 2-member family and 80% to account for spelling mistakes.
Is there a way to do this?
Related Posts with Is it possible to match families/groups and not individual observations?
Deleting entire Firm based on condition from panel data setHello Stata community, I have a panel dataset consisting of many firms over 20 years. Firms are iden…
collect: rows and columns' name changed?I am trying to learn Stata's -collect- and to manipulate tables. I find the manipulating part hard. …
Graphic with graphs for sub catvars and overall graphHello, I want to create a graphic with multiple graphs for each sub catvar and also a graph that is…
Convert Scientific Notations (e-07) into Decimal NotationHi All, I have various observations in my dataset like- 1.40e+09, 1.57e-07. How to convert these s…
Why some global parameter is in { } while the others are not?Dear all, I'm helping my prof modify past code to adjust to the new dataset. It is my first time us…
Subscribe to:
Post Comments (Atom)
0 Response to Is it possible to match families/groups and not individual observations?
Post a Comment