BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Stopping rules in cluster analysis on binary data
Stopping rules in cluster analysis on binary data

Hi everyone,

I have conducted an average linkage, hierarchical cluster analysis using the Sneath and Sokall similarity coefficent as all my variables are binary (present=1, absent=0), but co-absence shouldn't weigh as much as co-presence in the clustering. Now I have found that the stopping rules in cluster analysis supported by Stata are the Calinski–Harabasz pseudo-F and the The Duda–Hart Je(2)/Je(1) index. However, both of these are for continous data.

Is there any way I could for instance use an adaptation of the Goodman ad Kruskal's gamma statistic for categorical data or something else like it in Stata?

FYI: I have nearly copy-pastet this post https://www.statalist.org/forums/for...on-binary-data as the problem described there is nearly the same as mine, however, no solution is provided. I am hoping a solution has been found since 2017.

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Stopping rules in cluster analysis on binary data
Stopping rules in cluster analysis on binary data

0 Response to Stopping rules in cluster analysis on binary data

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Stopping rules in cluster analysis on binary data Stopping rules in cluster analysis on binary data

0 Response to Stopping rules in cluster analysis on binary data