Since it is a university assignment the normal approach (as i have been taught, and is the recommendations in https://www.iuj.ac.jp/faculty/kucc62...blq5Qmk7KvdJLg) would be to start of with a simple model like a Pooled OLS, and then if that isn't sufficient, or the assumptions of the model don't seem to hold up, then you move on to fixed or random effects models. Gladly correct me if this approach isn't optimal.
My first issue when doing the Pooled OLS, is figuring out if it is actually done correctly (As i have seen different approaches from different sources). From what i can tell you do this by running clustered standard errors.
Code:
reg Covid19_cases x1 x2 x3 Country, vce(cluster Country)
Question 2. How do i test the assumptions of heteroskedasticity and autocorrelation when using clustered standard errors, as this seems to make it impossible to run a Breusch-Pagan test.
Code:
. hettest hettest not appropriate after robust cluster() r(498);
Furthermore, i know that -xtreg usually outperforms -reg (with clustered standard errors) when it comes to panel data regression.
So my Question 3 (See output from Pooled OLS and Random effects below) is how do i based on the stata output determine whether i should use Pooled OLS, fixed or random effects model. (As almost all my variables are static, i know that i'll probably end up with a -re effects model. I just simply haven't been able to statistically argue for this point of view, as i can't even test for things like heteroskedasticity and autocorrelation)
output from Pooled OLS:
Code:
Linear regression Number of obs = 4,592 F(19, 41) = 303.69 Prob > F = 0.0000 R-squared = 0.7294 Root MSE = 1242 (Std. Err. adjusted for 42 clusters in Country) ----------------------------------------------------------------------------------------------- | Robust Covid19_cases | Coef. Std. Err. t P>|t| [95% Conf. Interval] ------------------------------+---------------------------------------------------------------- Ages0_14Pct | 8123.495 11156.99 0.73 0.471 -14408.5 30655.49 Ages65_99Pct | 3038.334 11320.49 0.27 0.790 -19823.86 25900.53 Ages15_64Pct | 7851.408 11239.37 0.70 0.489 -14846.96 30549.78 Covid19_deaths | 9.630924 1.715408 5.61 0.000 6.166587 13.09526 CrimeIndex | -8.226521 5.282897 -1.56 0.127 -18.89555 2.442507 DAI_B_index | 1999.881 1276.736 1.57 0.125 -578.5393 4578.301 DAI_G_index | 290.8351 454.5301 0.64 0.526 -627.107 1208.777 DAI_P_index | 3.885746 690.839 0.01 0.996 -1391.292 1399.064 Gdp2018 | .1692471 .0260085 6.51 0.000 .1167219 .2217724 GdpAgriculturalPct | 1772.46 2558.133 0.69 0.492 -3393.794 6938.715 GdpIndustrialPct | -35.28193 2158.263 -0.02 0.987 -4393.983 4323.419 GdpServicePct | 150.5583 2216.818 0.07 0.946 -4326.396 4627.512 InternetUsage2014Pct | -283.9534 701.2951 -0.40 0.688 -1700.248 1132.341 popData2018 | -9.46e-07 4.60e-07 -2.06 0.046 -1.87e-06 -1.64e-08 pop_AnnualGrowthPct_2010_2018 | -15230.27 11371.27 -1.34 0.188 -38195.02 7734.475 pop_density18 | -.3554776 .4322918 -0.82 0.416 -1.228509 .5175533 SocialMobilityIndex | -9.681271 21.7794 -0.44 0.659 -53.66567 34.30313 StringencyIndex | 3.398476 1.255605 2.71 0.010 .8627302 5.934222 Country | .3061796 1.2523 0.24 0.808 -2.222892 2.835251 _cons | -7908.159 12120.52 -0.65 0.518 -32386.04 16569.72 -----------------------------------------------------------------------------------------------
output from Random effects:
Code:
xtset Country Date
Code:
Random-effects GLS regression Number of obs = 4,592 Group variable: Country Number of groups = 42 R-sq: Obs per group: within = 0.6600 min = 51 between = 0.9450 avg = 109.3 overall = 0.7293 max = 113 Wald chi2(18) = 9283.67 corr(u_i, X) = 0 (assumed) Prob > chi2 = 0.0000 ----------------------------------------------------------------------------------------------- Covid19_cases | Coef. Std. Err. z P>|z| [95% Conf. Interval] ------------------------------+---------------------------------------------------------------- Ages0_14Pct | 7719.581 17073.57 0.45 0.651 -25744 41183.17 Ages65_99Pct | 2600.686 16826.48 0.15 0.877 -30378.61 35579.98 Ages15_64Pct | 7477.483 17549.24 0.43 0.670 -26918.39 41873.36 Covid19_deaths | 9.728414 .1100511 88.40 0.000 9.512718 9.94411 CrimeIndex | -8.30544 6.737789 -1.23 0.218 -21.51126 4.900383 DAI_B_index | 1956.444 1257.037 1.56 0.120 -507.3033 4420.191 DAI_G_index | 308.9836 463.2871 0.67 0.505 -599.0424 1217.01 DAI_P_index | 64.28808 1023.292 0.06 0.950 -1941.327 2069.903 Gdp2018 | .1685496 .0209757 8.04 0.000 .127438 .2096611 GdpAgriculturalPct | 1816.435 4797.581 0.38 0.705 -7586.651 11219.52 GdpIndustrialPct | -124.5615 4119.166 -0.03 0.976 -8197.979 7948.856 GdpServicePct | 135.0255 4126.775 0.03 0.974 -7953.304 8223.356 InternetUsage2014Pct | -392.6615 1069.255 -0.37 0.713 -2488.362 1703.039 popData2018 | -9.59e-07 3.03e-07 -3.16 0.002 -1.55e-06 -3.64e-07 pop_AnnualGrowthPct_2010_2018 | -15717.27 16056.61 -0.98 0.328 -47187.65 15753.11 pop_density18 | -.3674106 .5401536 -0.68 0.496 -1.426092 .691271 SocialMobilityIndex | -8.118353 21.31227 -0.38 0.703 -49.88963 33.65293 StringencyIndex | 3.613653 .5180906 6.97 0.000 2.598214 4.629092 _cons | -7511.384 18162.4 -0.41 0.679 -43109.03 28086.26 ------------------------------+---------------------------------------------------------------- sigma_u | 319.2514 sigma_e | 1214.6213 rho | .06462069 (fraction of variance due to u_i) -----------------------------------------------------------------------------------------------
Data description:
21 variables, and 4592 observations. (unbalanced dataset)
Variable | Description | |
Date | Time indicator (In days) | |
StringencyIndex | Index measuring the goverment response to Covid19. 100 being the most severe response, and 0 being the loosest response. |
|
Covid19_cases | Dependent variable Measuring the number of recorded covid19 cases |
|
Covid19_deaths | Measuring the number of recording deaths caused by covid19 | |
popData2018 | 2018 country population data | |
DAI_index | Digital adoption index Measuring a countries digital adoption across three dimensions of the economy: people, government, and business |
|
DAI_B_index | Measuring a countries digital adoption across business | |
DAI_P_index | Measuring a countries digital adoption across people | |
DAI_G_index | Measuring a countries digital adoption across government | |
pop_AnnualGrowthPct_2010_2018 | Measuring a countries annual growth in population from 2010 to 2018 in pct. | |
Ages0_14Pct | Measuring the pct. of a countries population who are between 0 and 14 years of age. | |
Ages15_64Pct | Measuring the pct. of a countries population who are between 15 and 64 years of age. | |
Ages65_99Pct | Measuring the pct. of a countries population who are between 65 and 99 years of age. | |
Ages0_99Pct | Measuring the pct. of a countries population who are between 0 and 99 years of age. | |
CrimeIndex | Index measuring crime rates by country. 100 being the highest crimes rates and 0 being the lowest |
|
SocialMobilityIndex | Index measuring social mobility by country 100 being the highest social mobility and 0 being the lowest |
|
Gdp2018 | Country GDP by 2018 numbers | |
GdpAgriculturalPct | Pct. of a countries GDP that comes from the agriculture sector | |
GdpIndustrialPct | Pct. of a countries GDP that comes from the industrial sector | |
GdpServicePct | Pct. of a countries GDP that comes from the service sector | |
InternetUsage2014Pct | % of a countries population that uses the internet, by 2014 numbers | |
Country | Entity indicator | |
Continent | Continent | |
pop_density18 | Population density by country by 2018 numbers |
Best regards, Walther Larsen
0 Response to Pooled OLS, fixed & random effects: Panel Data
Post a Comment