Hi everyone,
I have conducted an average linkage, hierarchical cluster analysis using the Sneath and Sokall similarity coefficent as all my variables are binary (present=1, absent=0), but co-absence shouldn't weigh as much as co-presence in the clustering. Now I have found that the stopping rules in cluster analysis supported by Stata are the Calinski–Harabasz pseudo-F and the The Duda–Hart Je(2)/Je(1) index. However, both of these are for continous data.
Is there any way I could for instance use an adaptation of the Goodman ad Kruskal's gamma statistic for categorical data or something else like it in Stata?
FYI: I have nearly copy-pastet this post https://www.statalist.org/forums/for...on-binary-data as the problem described there is nearly the same as mine, however, no solution is provided. I am hoping a solution has been found since 2017.
Related Posts with Stopping rules in cluster analysis on binary data
heatplot assistanceHello dear statalist members, I'm working with the heatplot command and I have an issue with the lo…
Is there any way to create age and education variables of a father?Hi all, I am working with the Multiple Indicator Cluster Survey (MICS) whose research design is sim…
selection Problem out of huge amount of occupationsDear Stata Community, Iam almost new to stata and Iam kind of stuck with a problem. May someone can…
Not duplicating Mata code / Using STATA in MataI have a function which is straightforward to implement in both the ado scripting language and mata.…
loop over files in a folder to do an operation and save the combined result as datasetHello everyone!! I should specify, I am new to stata, so if i misuse terminology, please forgive me…
Subscribe to:
Post Comments (Atom)
0 Response to Stopping rules in cluster analysis on binary data
Post a Comment