Hi everyone,
I have conducted an average linkage, hierarchical cluster analysis using the Sneath and Sokall similarity coefficent as all my variables are binary (present=1, absent=0), but co-absence shouldn't weigh as much as co-presence in the clustering. Now I have found that the stopping rules in cluster analysis supported by Stata are the Calinski–Harabasz pseudo-F and the The Duda–Hart Je(2)/Je(1) index. However, both of these are for continous data.
Is there any way I could for instance use an adaptation of the Goodman ad Kruskal's gamma statistic for categorical data or something else like it in Stata?
FYI: I have nearly copy-pastet this post https://www.statalist.org/forums/for...on-binary-data as the problem described there is nearly the same as mine, however, no solution is provided. I am hoping a solution has been found since 2017.
Related Posts with Stopping rules in cluster analysis on binary data
Unrealistic R squared in LSDV modelHello everybody, Situation: I use Stata 14.2. I want to investigate the effects of mobile phone pen…
Stata Regression YearsHi all, I looked at a sample of companies and their balance sheets from 2014 to 2017. To do this, I…
100% sensitivity in my estat test, no data classified as negative & I'm only seeing 16/1771 case on my roc print out and scatterplotGood afternoon, When I run my logistic regression, everything comes out OK with no strange data. Ho…
How to run independence test on 3 categorical variables on Stata?I'm working on descriptive analysis and wish to compare 3 categorical variables: Sex (man / woman) …
Interpreting regression coefficients with min-max scalingHi, I am running a regression with indices. Dependent variable is overall index while independent v…
Subscribe to:
Post Comments (Atom)
0 Response to Stopping rules in cluster analysis on binary data
Post a Comment