Hello Everyone,
Let's say I would like to do a linear regression on some Panel data without knowing if linear regression is the most appropriate technique to use. Before doing any analysis, is it better to have more variables, and extrapolate and interpolate missing observations for a few of them (say perhaps 25% of the variables need this out of a total of 30), or is it better to delete the years that contain these missing observations? (especially if the raw data has a mismatch of years with and without observations).
There are also control variables that allow for the testing of models for years and countries excluding the extrapolated observations if it matters. Please see the attached picture for an idea of what I'm talking about.
I'm new to cleaning panel data I just want some insight on what to look out for and what to prioritize for significant results that don't compromise truth. Thank you!
Related Posts with Cleaning for Optimal Modeling (ft. Panel Data)
Creating Attrition (Non-response) Weights for Balanced PanelHi everyone, I am working with panel data which consists of three waves (3, 4, and 5) as below. I wa…
Calculate days from hospital admission to a procedure using National Inpatient Sample (NIS)Hi all, My goal is to calculate time (days) from admission to the time a patient first received a s…
Variable not found when running loop even though it existsHello all, This is my first time asking a question on here, so please excuse me if I am describing …
Sorting ID’sHello, My data looks like this: Code: * Example generated by -dataex-. To install: ssc install da…
Assigning variable label when generating a new oneHi everyone, I am trying to generate a new variable named "new" using an existent variable named "ol…
Subscribe to:
Post Comments (Atom)
0 Response to Cleaning for Optimal Modeling (ft. Panel Data)
Post a Comment