Hi Stata Forum
I have an issue with a tobit regression predicting too high values.
As a part of me and my colleague’s master’s thesis, we are trying to use a Tobit regression to predict the proportion of Special Items (an accounting post) that can be considered as opportunistic.
Inspired by an American research paper we are using Compustat data for all public Nordic companies as the basis for our predictions. We have recreated the variables from the American paper and our coefficients resemble the original when running the tobit-regression.
The company observations are grouped by industry and year, and we have created the IndYear variable accounting for this - IndYear are dropped if they have less than 30 observations.
We run a forvalues loop for each IndYear and try to make IndYear specific predictions, but the predicted SI (PredSI) is higher than the actual SI 75 pct. of the time - for the American paper it predicted too high 7 pct. of the time.
The model should predict the “real” SI and the residual represents the opportunistic part, so the residuals can’t be negative.
We are using the following command for the regression-loop:
gen PredSI = .
forvalues i = 1/62 {
tempvar test
tobit SI $cntrl if IndYear ==`i', vce(cluster CompanyName)
predict `test', ystar(0,.5)
replace PredSI = `test' if IndYear == `i'
}
PredSI = Predicted Special Items
SI = Special Items
$cntrl holds 14 variables
All variables have been winsorized by IndYear using winsor2 at a (1 99) or (0 99) if they have a lower boundary.
Hope somebody can help!
Best regards
Mathias
Related Posts with Issue with a Looped Tobit Regression Predicting too high values
Interaction effect in probit/logit significant?Dear Listers, My dependent variable is binary, so is one of my two independent variables. The other…
Compare permute to ttestDear all, maybe this is a silly question but I cannot spot the error. Basically, I can compare 2 sam…
How to interpret PCA outcome variables?Hi all, I have a question with regards to principal component analysis. On the basis of pca, I crea…
Endogeneity check in Panel data (FE/RE)Dear all, I am working with Panel data (FE and RE models) and performing several diagnostic tests a…
forest plots for sensitivity specificityHi, I am practicing meta-analysis of diagnostic accuracy studies using the midas software on stata u…
Subscribe to:
Post Comments (Atom)
0 Response to Issue with a Looped Tobit Regression Predicting too high values
Post a Comment