I have lots of zeros in both my dependent and independent variables.
One way that I was dealing with this is by adding 1 to all of the values. However, this makes each of the variables right-skewed. So I took the natural log to create a normal distribution. But when I do this I get a spike to the left followed by a normal distribution (see an example below). As I believe this violates the assumption of normal distribution, I tried dropping the zeros which reduces the sample size too much and then I don't get significance in my models. I read that I could impute the zero values with the mean, but I know that would misrepresent my data. I also read that I could take the square root instead of the log for transformation, but the data is still right skewed rather than having a normal distraction. Any other thoughts on how I might deal with this issue would be much appreciated!
Related Posts with Dealing with zeros
Compare the unexplained wage differential of the Blinder-Oaxaca decomposition between two groupsDear all, after using the Blinder-Oaxaca decompostion to decompose the African American/ White wage …
bidensity assigns too many 0Hello, I am trying to acquire a bivariate density contour plot of fathers' and their sons' income. I…
combining two bar graphsHi, I am wanting to compare the mean of an outcome amongst male and female children in households wi…
GLM Fracreg or Count modelHello, I have received contradictory suggestions on how to analyze certain data and would love to ge…
Exponentiated form with mi estimate and xtgee: no effect when using eformHello Statalist, I am using Stata 16 and I am trying to obtain the exponentiated form of my estimat…
Subscribe to:
Post Comments (Atom)
0 Response to Dealing with zeros
Post a Comment