Hi everyone,
I have conducted an average linkage, hierarchical cluster analysis using the Sneath and Sokall similarity coefficent as all my variables are binary (present=1, absent=0), but co-absence shouldn't weigh as much as co-presence in the clustering. Now I have found that the stopping rules in cluster analysis supported by Stata are the Calinski–Harabasz pseudo-F and the The Duda–Hart Je(2)/Je(1) index. However, both of these are for continous data.
Is there any way I could for instance use an adaptation of the Goodman ad Kruskal's gamma statistic for categorical data or something else like it in Stata?
FYI: I have nearly copy-pastet this post https://www.statalist.org/forums/for...on-binary-data as the problem described there is nearly the same as mine, however, no solution is provided. I am hoping a solution has been found since 2017.
Related Posts with Stopping rules in cluster analysis on binary data
Query on Appending two datasets.Hi everyone, Im trying to append two datasets of different versions together. The variable names ac…
Gen date variable with missing valuesHi, i want to loop to convert my strings data date starting by "d" into date format. But I have vari…
Diff-in-Diff with Cross-sectional DataI collected data to measure the effectiveness of a government educational program on poverty reducti…
Wild bootstrap Fixed and Random effects model.Dear Statalist, The topic of my thesis is "How firm specific characteristics affect a firm's Cash ho…
When I do K-Wallis test, it divides one of my factorial categorical variable into 2 (picture attached) What could possibly be the reason?In this picture I have 4 different groups of cocaine use (IV, ingestion, nasal, smoke). Why smoke (s…
Subscribe to:
Post Comments (Atom)
0 Response to Stopping rules in cluster analysis on binary data
Post a Comment