BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Dealing with zeros
Dealing with zeros

I have lots of zeros in both my dependent and independent variables.

One way that I was dealing with this is by adding 1 to all of the values. However, this makes each of the variables right-skewed. So I took the natural log to create a normal distribution. But when I do this I get a spike to the left followed by a normal distribution (see an example below). As I believe this violates the assumption of normal distribution, I tried dropping the zeros which reduces the sample size too much and then I don't get significance in my models. I read that I could impute the zero values with the mean, but I know that would misrepresent my data. I also read that I could take the square root instead of the log for transformation, but the data is still right skewed rather than having a normal distraction. Any other thoughts on how I might deal with this issue would be much appreciated!

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Dealing with zeros
Dealing with zeros

0 Response to Dealing with zeros

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Dealing with zeros Dealing with zeros

Related Posts with Dealing with zeros

0 Response to Dealing with zeros