Good day all
I am using a stacked cross-sectional dataset, called the South African Post-Apartheid Labour Market Series (PALMS), from the years 1993 to 2017.
There is a lot of missing data for the monthly earnings variable. Therefore, I have been requested to figure out how to go about doing a cell mean imputation for item non-response on missing earning figures. This is apparently done by calculating the cell mean of earnings for all those who have the same education (coded to be if they have less than 12 years of schooling, have 12 years or more than 12 years) and belong to the same population group; and then giving those in the same groups with missing earnings this cell mean.
I do not know how to go about doing this. I would understand i could use a loop for the respective years, but I am at a loss with calculating the cell means and imputing them.
Would anyone be able to help?
Regards
Related Posts with Cell Mean Imputation
bootstrap for panel dataDear All, I modify a code to test the significance of product of two coefficients from two panel dat…
How to include age as a dummy control in a fixed effect regression?Dear all, I have a problem understanding how i should specify the contol variable age in the follow…
interaction effect logistic regressionDear Stata users, I am working on a paper that includes a logistic regression (logistics command for…
Same value for duplicatesHello, statalist! I am struggling with the replace the value for the same person. In the dataset, t…
Reshape only a group of variables within a datasetDear Stata users, I would like to create a variable named region in one column. This is an example …
Subscribe to:
Post Comments (Atom)
0 Response to Cell Mean Imputation
Post a Comment