Hello,
I struggle to find the right method for what I want to do using two household surveys. I have two datasets:
1) X dataset with socio-econ info (A1) and Z info
2) Y dataset with socio-econ info (A2)
The Y dataset does not have Z info and this is what I want to impute based on the X dataset. The imputation/matching will be based on socio-econ info (A1 and A2). Which method is the best? I looked into MI with MAR options where they use mixed-method multiple imputations but this method is based on the fact that you impute missing values from the SAME population. I'm not so sure if I can use this method with my data.
If my example is too abstract then consider this: I have two household survey datasets. X has expenditures on food, clothing, and house fuels but Y dataset does not have it so I need to impute this information. This I can do because I have information related to income, household size, appliances ownership, etc in both datasets. So if the marginal distribution in both datasets X and Y is similar for these socio-econ characteristics I can then impute the expenditure data.
I would greatly appreciate any help - even naming method or tools that are available in STATA will be super helpful!
Cheers,
Marta
Related Posts with methodological question: matching/imputation based on two datasets
Interpreting results of a model for nonnegative, skewed dependent variablesHi there, I am I am working with data where the outcome is continuous and we have several predictor …
Reporting duplicates on two variables that are not duplicates on a third variableHi all, I am trying to identify cases that are duplicates on two variables "agencyname" and "statef…
looping through data filesHi, I'm using Stata 14 on Windows. Currently I use this syntax for each country SEPARATELY (here on…
Effect of variation in correlation and the in number of repeated measurements on power analysis with the "power repeated" commandDear Statalisters, I'm using Stata 17.0. By looking at this paper: https://www.jstor.org/stable/pd…
xtdidreAfter executing xtdidregress, I do estat trendplots. It gives me "treatment assignment times vary; n…
Subscribe to:
Post Comments (Atom)
0 Response to methodological question: matching/imputation based on two datasets
Post a Comment