Dear Statalist community,
I am doing research using insurance claim data, where the dependent variable of interest is the loss-cost ratio, namely indemnity amount divided by the total liability. Naturally, it is a fractional variable bounded between [0,1]. However, it has excessive zeros, due to deductibles, and my understanding is these zeros are essentially censored because "zero" can mean positive actual loss. So simply put, the dependent variable is a fractional response with censored zeros. There are several alternative modeling approaches I can think of, but each of them misses certain aspects if I understand them correctly:
1. Fractional response model as in Papke and Wooldridge (1996): may not be best when the number of zero observations is large; in this case also misses the censoring nature at zeros.
2. Two-limit Tobit: misses the fractional nature of the variable; strong distributional assumptions.
3. Zero-inflated beta model as in Cook et al. (2011): does not account for the censoring nature of zeros.
4. Two-part fractional response model as in Ramalho and Ramalho (2011): due to some reasons, we want to analyze a balanced panel, but the two-part model essentially uses a subsample containing (0,1) observations in the second part which results in unbalanced data in estimation. Hence we prefer not to use this.
5. Augmenting fractional response model by modeling heteroskedasticity as in Wooldridge slides page 7: honestly I don't understand why this works, I'd appreciate it if anyone could explain; but also it doesn't reflect the censoring nature of zeros.
So my questions are:
(1) why is pproach 5 above able to account for excessive zeros?
(2) what would be the best approach to model my dependent variable described above, i.e., a fractional variable with excessive censored zeros, while estimating a balanced panel?
Besides, if I misunderstood anything, please feel free to point it out, thanks!
Much appreciated,
Zhenni
Related Posts with Fractional response with censored zeros
Survey concordance correlationHello everyone: Is there any way to obtain accurate estimates and 95% confidence intervals for the …
Creating interaction terms between a group of year dummies and a group of product dummies and how to use them in a regression equation.Hello, I have a challenge with how to create a kind of interaction terms for purposes of running a …
Omitted independent variables in xtabond2 using twostep system GMMHi, I am trying my data set in two-step System GMM, three variables annual GDP growth rate, Bond mar…
Importing several excel files: matching rowsHi everyone, I imported over 250 excel files and appended them all together to 1 dta file, by using…
What's the correct way of interpreting the (exponentiated) log odds?Hello everyone, I am trying to translate the following output to a table in a word document. Howev…
Subscribe to:
Post Comments (Atom)
0 Response to Fractional response with censored zeros
Post a Comment