I have lots of zeros in both my dependent and independent variables.
One way that I was dealing with this is by adding 1 to all of the values. However, this makes each of the variables right-skewed. So I took the natural log to create a normal distribution. But when I do this I get a spike to the left followed by a normal distribution (see an example below). As I believe this violates the assumption of normal distribution, I tried dropping the zeros which reduces the sample size too much and then I don't get significance in my models. I read that I could impute the zero values with the mean, but I know that would misrepresent my data. I also read that I could take the square root instead of the log for transformation, but the data is still right skewed rather than having a normal distraction. Any other thoughts on how I might deal with this issue would be much appreciated!
Related Posts with Dealing with zeros
Ceo pay sliceDear Stata Community, I need your guidance in calculating the CEO PAY SLICE (CPS) proposed by Bebchu…
Interpretation Kleibergen-Paap, Cragg-Donald and Stock-Yogo weak IDDear users, for my thesis I'm working with an IV regression, where I try to see what effect stock o…
Question:Hi all, As part of my master's progam in Accountancy, I am doing a replication study. I was wonderi…
CPS monthly data coding suggestion for matching indivudual over timeHello stata community, I have been working with CPS ASEC data for over a couple of months but now I…
Reversion mean after overreactionsI'm studying overreactions in the BVSP index and I need to check if after these reactions the return…
Subscribe to:
Post Comments (Atom)
0 Response to Dealing with zeros
Post a Comment