Hi everyone,
I am currently working on looking at the impact of intellectual property rights on the Indian pharmaceutical industry. I have a panel data set (secondary data from CMIE) of 350 firms across 28 time periods. However, I am facing a big problem with regard to missing data. Almost all the variables I need to consider in the model (Eg: R&D=f(pat, exports, imported tech etc) have missing data ranging from 10% to 30%. How best would you suggest I handle this problem before undertaking any analysis? List wise deletion in Stata reduces the number of firms to 68, drastically reducing the sample size.
Is multiple imputation of data when all variables have some missing values a possibility in Stata?
Thank you in advance!
Related Posts with Problem of handling missing data
Confidence Intervals / Errr plots with binscatterI am trying to include confidence intervals / errors bars in a plot for two variables using -binscat…
Calculating difference in differences, percentages and treatment effectsHi everyone, I'm working on a project for my economic development course and am having difficulty d…
Count dates within previous 31 day range for each date using forvaluesDear Stata colleagues, I created a loop in order to count the number of dates following 3 condition…
Mata debugging strategiesMight anyone suggest helpful references on strategies for Mata debugging? …
Finding the number of firms for which a given worker has worked - Matched employer-employee datasetDear all, I am working with a matched employer-employee dataset from Brazil in which each observati…
Subscribe to:
Post Comments (Atom)
0 Response to Problem of handling missing data
Post a Comment