Hi everyone,
I am currently working on looking at the impact of intellectual property rights on the Indian pharmaceutical industry. I have a panel data set (secondary data from CMIE) of 350 firms across 28 time periods. However, I am facing a big problem with regard to missing data. Almost all the variables I need to consider in the model (Eg: R&D=f(pat, exports, imported tech etc) have missing data ranging from 10% to 30%. How best would you suggest I handle this problem before undertaking any analysis? List wise deletion in Stata reduces the number of firms to 68, drastically reducing the sample size.
Is multiple imputation of data when all variables have some missing values a possibility in Stata?
Thank you in advance!
0 Response to Problem of handling missing data
Post a Comment