Hi everyone,
I have conducted an average linkage, hierarchical cluster analysis using the Sneath and Sokall similarity coefficent as all my variables are binary (present=1, absent=0), but co-absence shouldn't weigh as much as co-presence in the clustering. Now I have found that the stopping rules in cluster analysis supported by Stata are the Calinski–Harabasz pseudo-F and the The Duda–Hart Je(2)/Je(1) index. However, both of these are for continous data.
Is there any way I could for instance use an adaptation of the Goodman ad Kruskal's gamma statistic for categorical data or something else like it in Stata?
FYI: I have nearly copy-pastet this post https://www.statalist.org/forums/for...on-binary-data as the problem described there is nearly the same as mine, however, no solution is provided. I am hoping a solution has been found since 2017.
Related Posts with Stopping rules in cluster analysis on binary data
Constant term omitted without specifying "nocons", helpI ran a logit regression and the constant term was missing. How can this be? xtlogit vigact3 years …
How to test heteroskedasticity with xtpmg command. ThanksDear all, I used xtpmg command to estimate three of following: PMG (Pooled Mean Group) MG (Mean Gro…
Reporting contrasts of marginal effects from logit/probit modelsLet's say I have a model to predict the probability of an outcome, and I am interested in reporting …
Fixed Effects at individual-area-level?Hello folks! I have read this paper today: http://ftp.iza.org/dp9311.pdf If you scroll to page 11,…
Merging datasets togetherI am currently in a tricky situation. I have a dataset with 83,311 observations and 109 variables. I…
Subscribe to:
Post Comments (Atom)
0 Response to Stopping rules in cluster analysis on binary data
Post a Comment