Hello Everyone,
Let's say I would like to do a linear regression on some Panel data without knowing if linear regression is the most appropriate technique to use. Before doing any analysis, is it better to have more variables, and extrapolate and interpolate missing observations for a few of them (say perhaps 25% of the variables need this out of a total of 30), or is it better to delete the years that contain these missing observations? (especially if the raw data has a mismatch of years with and without observations).
There are also control variables that allow for the testing of models for years and countries excluding the extrapolated observations if it matters. Please see the attached picture for an idea of what I'm talking about.
I'm new to cleaning panel data I just want some insight on what to look out for and what to prioritize for significant results that don't compromise truth. Thank you!
Related Posts with Cleaning for Optimal Modeling (ft. Panel Data)
xtgls options in StataHi, Does someone know when should we use corr(ar1) or corr(psar1)? Any example from literature? I t…
Survival DataDear all, I am trying to restructure/compress my dataset because it’s currently too big to do anythi…
Balance Table and RandomizationHi, I am working with data of a random experiment. How ever, in my case of study, in a moment I lose…
Keep if or Drop IfHello, I am trying to select a couple of cases in a dataset - and perform a set of operations/chang…
VIF test of multi-collinearityDear list members, after running some -ivregress2 2sls- and -ivprobit- regressions, I am unsure whi…
Subscribe to:
Post Comments (Atom)
0 Response to Cleaning for Optimal Modeling (ft. Panel Data)
Post a Comment