BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Cluster based on string similarity
Cluster based on string similarity

Hey Community,

I'm quite new to working with Stata and therefore desperately looking for help! I have a dataset consisting of >200 firms and different characteristics of these firms such as their industry affiliation (see example below). However, each firm has multiple industry group affiliations. My goal is to cluster these firms based on the similarity of industry group affiliation and to create a new categorical variable consisting of those 3 clusters. Has anyone experience with this kind of problem or can help me on how to ideally approach this? Thank you so much in advance!!

Data:

firm_id	industry_groups
1	Advertising, Commerce and Shopping, Sales and Marketing
2	Advertising, Media and Entertainment, Mobile, Sales and Marketing, Software
3	Energy, Natural Resources, Sustainability
...	...

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Cluster based on string similarity
Cluster based on string similarity

0 Response to Cluster based on string similarity

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Cluster based on string similarity Cluster based on string similarity

0 Response to Cluster based on string similarity