Hello Everyone,
Let's say I would like to do a linear regression on some Panel data without knowing if linear regression is the most appropriate technique to use. Before doing any analysis, is it better to have more variables, and extrapolate and interpolate missing observations for a few of them (say perhaps 25% of the variables need this out of a total of 30), or is it better to delete the years that contain these missing observations? (especially if the raw data has a mismatch of years with and without observations).
There are also control variables that allow for the testing of models for years and countries excluding the extrapolated observations if it matters. Please see the attached picture for an idea of what I'm talking about.
I'm new to cleaning panel data I just want some insight on what to look out for and what to prioritize for significant results that don't compromise truth. Thank you!
Related Posts with Cleaning for Optimal Modeling (ft. Panel Data)
Error r(603) has suddenly started to occur in Mac OSRunning Stata 17 on Mac OS 11.6.6. Until two days ago (1/6/22) I was able to run the following comm…
Logistic RegressionI am fairly new to Statistics. I have a large datatset where N=50,000. I have a dichotomous outcome …
short term and long term analysisHello everyone, I want to test the short-term and long-term impact of a certain independent variabl…
A Solution for Publication Quality Regression Tables (from STATA to .DOC)Dear friends, I often use Latex to generate publication-quality tables and figures and the STATA-La…
Reshape vs stack helpHello all: I am trying to reshape a set of variables on the second duplicated row. c11-17 correspon…
Subscribe to:
Post Comments (Atom)
0 Response to Cleaning for Optimal Modeling (ft. Panel Data)
Post a Comment