I wanted to merge two data sets. There are 99 variables and 315,717 observations (size: 56,513,343) in 1st data set (master). It (MORG data (from Current Population Survey) has the basic demographic variables (age, sex, race, and marital status etc.) The 2nd data set (fatal occupational injury data from the BLS) set has 2 variables (injury rate and occupation code) and 162 observations (size: 1,296). I have sorted both data sets by the variable, occ2012 (occupation code variable) before merging them:
I am not sure which merge command to use, merge 1:1, 1:m or m:1.
I have used m:1 and got this:
use "C:\Users\hmridha\Documents\Fall2018\paper 4\data ETC\sortedmorg13.dta"
. merge m:1 occ2012 using "C:\Users\hmridha\Documents\Fall2018\paper 4\data ETC\foic_new\sortedfoic13.dta"
(note: variable occ2012 was int, now float to accommodate using data's values)
Result # of obs.
-----------------------------------------
not matched 312,407
from master 312,255 (_merge==1)
from using 152 (_merge==2)
matched 3,462 (_merge==3)
-----------------------------------------
Does this merging look all right? I would appreciate any help.
Related Posts with Merging two data sets
Is there a command for quickly splitting a categorical variable into multiple binary variables?Is there a command to split a categorical variable into binary variables? For example, splitting a "…
What does _rc 111 mean in merge?I have a merge statement that returns 111 return code, but appears to have worked. Code: merge 1:m…
margins at percentilesHi Statalists, I have code like this: Code: regress y x1 x2 x3 egen p10 = pctile(x1), p(10) egen …
Interpolating missing dataCan we interpolate missing data for central bank policy rate by using leading rate or any other econ…
FE with four-way error-componentsHello everyone, I am estimating a DiD (difference in difference) with Least-squares dummy-variables…
Subscribe to:
Post Comments (Atom)
0 Response to Merging two data sets
Post a Comment