Hi. I just joined statalist and ask the first question as a beginer.
I'm worrying about duplicate question, but ask for individualized question. Please give me a thoughtful understanding.
I'm on analysis for risk factor of development of proteinuria.
Sample size is 319,457.
To list and compare variables according to some categories (BMI categories or Presence of proteinuria) ,
I need to know whether the variables follows a normal distribution or not.
(Because to use independent t-test or ANOVA for normal distribution continuous variables, and Kruskal-Wallis test for non-normal distribution continuous variables.)
So I performed sktest for normality test.
Skewness/Kurtosis tests for Normality
------ joint ------
Variable | Obs Pr(Skewness) Pr(Kurtosis) adj chi2(2) Prob>chi2
-------------+---------------------------------------------------------------
wbc | 317,927 0.0000 0.0000 . .
bmi | 317,927 0.0000 0.0000 . .
sbp | 317,927 0.0000 0.0000 . .
height | 317,927 0.0000 0.0000 . 0.0000
Q1) Is it right to interpret this result as not following the normal distribution?
Q2) Why wbc, bmi, sbp doesn't report Prob >chi2 , but height report 0.000
When I performed sum, detail for height , it's skewness was 0.05 and kurtosis 3.21
And.. when I draw histogram, like below.
Array
Q3) Not strictly, can I judge this variable as a normal distribution variable ???
I think the sample size is too large so sktest reports does not follow the normal distribution.
Q4) Is there any appropriate test for large number of sample data??
And. Last...
I learned Central Limit Theorem (when sample size is large enough, we can assume the data follows normal-distribution.)
Q5) Can I apply this theorem to my analysis?? It means, whether I can use mean value instead of median value and use ANOVA test instead of Kruskal-Wallis test .
Related Posts with Normality test*for large sample data
Confirmatory factors analysis CFA in panel dataDear readers, I am carrying out a study with panel data from 2011 to 2016 for several cities. I cre…
Calculating investor turnover with Gaspar's formulaHi, I'm new to Stata so apologies if my question seems a bit 'simple'. I'm currently researching the…
pooled OLS specificationHello, I am running regressions on an unbalanced dataset with large N small T. I have already perfo…
Using ineqrbd with pweightsDear all, I am currently working with the user written program -ineqrbd- (for regression-based deco…
PPML or PPMLHDFEDear all, I am currently working on my master thesis and I would like some guidance, since the data…
Subscribe to:
Post Comments (Atom)
0 Response to Normality test*for large sample data
Post a Comment