I am trying to estimate a dynamic panel data on gdp growth rates with large T, by far greater than N. The estimation method is ivreghdfe a linear Model with Multi-Way Fixed Effects as in Correia (2016). Time and fixed effects clustering Driscoll-Kraay AR(1) s.e partialling-out exogenous regressors, but when I reach the point of treating specific groups, estimating only for those groups I get the error
Code:
Error: number of observations must be greater than number of instruments including constant. insufficient observations
My model is specified as
Code:
(yi,t − yi,t−1 )= (α − 1)yi,t−1 + β1yi,t−1 + β2T yi,t−1 + β3T 2yi,t−1 + yi, t+Σ1Nβ4+nyi,t + +β5Ζi,t+Di,t+μt+ηi,+εi,t
My code is for abbreviation is
Code:
ivreghdfe Dgdpgroth other RHS variables list of dummies and cattegorical year c.ind_*##c.ind_* (humancapital=ky) if imf_income2==1 & year >=1990, absorb(id) dkraay(1) partial(i.year)
All independent variable are endogenous variables. Dummy variables capture events, and categorical variables, capturing classification or duration of an event in years. Indicator are nine types of continuous indicator variables, not binary, allowed to interact with them. I am trying to see the effects of them on the GDP growth and the other macro variables. Their total number is large.
The general code I run and looks fine like being
Code:
ivreghdfe growthgdp var1 var2 var2 ...... var_n dummy1 dummy2....dummy_n year c.ind_*##c.ind_* (humancapital=ky), absorb(id) dkraay(1) partial(i.year)
but when I look into the specific group with limited group members and limit the years windows, for some groups only, the above-mentioned error appears
Code:
ivreghdfe growthgdp var1 var2 var2 ...... var_n dummy1 dummy2....dummy_n year c.ind_*##c.ind_* (humancapital=ky) if group2==1 & year >=1990, absorb(id) dkraay(1) partial(i.year)
I have three questions :
First, is if there is a way or trick to deal with this error? The obvious answer is to reduce the number of instruments, but that will create me a problem in comparing the results with the full scale model and the other groups where no such problem occur. I mean, can I get an estimation for those groups comparable to the other groups and the general model using somehow multiple fixed effects with clustering and partial out with dkraay AR(1) errors. Is there a way to approach this?
My second question if the syntax is correct as I would like to instrument for capital to output ratio (ky). I am interested in coefficients for human capital, but not sure if the syntax is correct. Could I just instrument capital to output ratio itself(ky=ky)?
It may happen when I impose the time restriction or limit the group to be N>T, depending on the single cases, nonetheless.
Third and my biggest concern is, since I am using partialling-out exogenous regressors, if I'm running a problem of weak instrument here. My display table for the general model is:
Code:
IV (2SLS) estimation -------------------- Estimates efficient for homoskedasticity only Statistics robust to heteroskedasticity and clustering on ts and kernel-robust to common correlated disturbances (Driscoll-Kraay) kernel=Bartlett; bandwidth=1 time variable (t): ts group variable (i): id Number of clusters (ts) = 35 Number of obs = 828 F(124, 34) = 7.1e+05 Prob > F = 0.0000 Total (centered) SS = 3885.575361 Centered R2 = 0.9057 Total (uncentered) SS = 3885.575361 Uncentered R2 = 0.9057 Residual SS = 366.2462637 Root MSE = .7559
variables coefficient for abbreviation are omitted
and
Code:
Underidentification test (Kleibergen-Paap rk LM statistic): 0.029 Chi-sq(1) P-val = 0.8638 Weak identification test (Cragg-Donald Wald F statistic): 0.019 (Kleibergen-Paap rk Wald F statistic): 0.025 Stock-Yogo weak ID test critical values: 10% maximal IV size 16.38 15% maximal IV size 8.96 20% maximal IV size 6.66 25% maximal IV size 5.53 Source: Stock-Yogo (2005). Reproduced by permission. NB: Critical values are for Cragg-Donald F statistic and i.i.d. errors.
R2 is too high over 0.90, maybe a sing of pathology but when it comes to IV then I am told that R-square is not that important. Is that correct
I know should have provided data to reproduce the problem, but there are too many variables to be included.
I would appreciate if you could comment, and I would appreciate if you could kindly provide a code on how to proceed for the estimation
Fei Wang Joao Santos Silva Jeff WooldridgeEnable GingerCannot connect to Ginger Check your internet connection
or reload the browserDisable in this text fieldRephraseRephrase current sentence63Edit in Ginger×
0 Response to Number of observations must be greater than number of instruments and under- and weak identification test
Post a Comment