Hi Stata Forum
I have an issue with a tobit regression predicting too high values.
As a part of me and my colleague’s master’s thesis, we are trying to use a Tobit regression to predict the proportion of Special Items (an accounting post) that can be considered as opportunistic.
Inspired by an American research paper we are using Compustat data for all public Nordic companies as the basis for our predictions. We have recreated the variables from the American paper and our coefficients resemble the original when running the tobit-regression.
The company observations are grouped by industry and year, and we have created the IndYear variable accounting for this - IndYear are dropped if they have less than 30 observations.
We run a forvalues loop for each IndYear and try to make IndYear specific predictions, but the predicted SI (PredSI) is higher than the actual SI 75 pct. of the time - for the American paper it predicted too high 7 pct. of the time.
The model should predict the “real” SI and the residual represents the opportunistic part, so the residuals can’t be negative.
We are using the following command for the regression-loop:
gen PredSI = .
forvalues i = 1/62 {
tempvar test
tobit SI $cntrl if IndYear ==`i', vce(cluster CompanyName)
predict `test', ystar(0,.5)
replace PredSI = `test' if IndYear == `i'
}
PredSI = Predicted Special Items
SI = Special Items
$cntrl holds 14 variables
All variables have been winsorized by IndYear using winsor2 at a (1 99) or (0 99) if they have a lower boundary.
Hope somebody can help!
Best regards
Mathias
Related Posts with Issue with a Looped Tobit Regression Predicting too high values
Robust standard errors with alternative applicationDear all, I wonder if anyone knows about an application or user written program that can be used to …
Convert Variable Values to Separate VariableHello, I am part of a research team analyzing survey data regarding attitudes towards a specific he…
Compare two coefficients in two different regressionsDear all: I would like to compare the coefficients generated by two different regressions (in two s…
Loop/Local Macro Not WorkingHello, Recently I've been having a lot of trouble with local macros, especially when using them for…
Presence of Heteroskedasticity, Autocorrelation and Cross-sectional Dependence/Contemporaneous Correlation - Panel DataI have a quarterly balanced panel data with 8 independent variables from 2000q1:2012:q2 (n=20 and T=…
Subscribe to:
Post Comments (Atom)
0 Response to Issue with a Looped Tobit Regression Predicting too high values
Post a Comment