I will be grateful if anyone here could help me with panel data regression using STATA. I am currently conducting Fixed Effect(FE) panel data regression using longitudinal survey data including two time-series ( wave1 in 2011 and wave2 in 2013). Theoretically, the number of observations in the regression should be an even number, as I only have 2 time periods(2 waves) in my regression. However, the number of observations that participated in the FE regression is an odd number (n=10625, see the screenshot below).
Array
I checked the duplicates in both waves in order to fix the odd number problem in my regression. However, I found only half of my sample have the observations in both two waves (n=5176, see the attached screenshots below). In the screenshot below, the observations that exist in both waves were indicated by 1, while 0 represents the unique observations that only exists in one wave.
Array
I am a bit confused that why STATA could still run the FE regression with missing data in a whole wave. After dropping the unique observations that only exist in one wave, I run the FE again and found STATA comes out with the exact same result except for the number of observations and groups in the regression (see the screenshots of the result below, n=5176).
Array
I am quite confused about how STATA works with the missing data. Should I drop the unique observations and keep only the observations in both waves? I am worried that I might lose a large number of observations and cause bias in my regression. But if I keep the unique observations, the analysis using longitudinal data would seem to be meaningless. I'm wondering if anyone knows why this problem happens /how to deal with this problem if anyone has done this before? Many thanks for that.
Related Posts with How STATA Works With Missing Data in Panel Data Regression
Graph with varying scales using by() optionI have data on emissions changes across states over time. The emissions are normalized as a percent …
Replacing observations appearing multiple times on the same variable on the same participant. Longitudinal study.Hi again. I have a new problem, and I'll try to explain it the best can. First, som background info…
Contradicting significance in complete pooled vs partial-poolingHi all, I'm working on nested structure data with individuals nested in countries (I may also defin…
"_Effect_Size not found" error when performing "netleague" command in Stata for network meta-analysisI have been learning to use Stata I/C 16 to perform network meta-analysis using the "mvmeta" and "ne…
Weigthed estimations with XTMG command (CCEMG and AMG)Hi everyone, I am estimating a regression with panel data for the model WfpPcLog ~ GdpPcLog + Gdp2P…
Subscribe to:
Post Comments (Atom)
0 Response to How STATA Works With Missing Data in Panel Data Regression
Post a Comment