BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

How to check correlation among 2000+ variables in a quick way?
How to check correlation among 2000+ variables in a quick way?

Hi, all,

I had a data set of 2000+ variables (all binary or categorical), for example, 1000+ are characteristics of people (such as nationality, gender, etc.), the other 1000+ are fruits and vegetables (with 1 indicate the person likes it, and 0 indicates the person does not like). For example, in the picture I attached, we can see 1. most Columbians love Jackfruit; 2. most female like watermelon and pineapple; 3. most German female like apple...etc.

Is there a way (for example, maybe there is a code that STATA could automatically run many regressions and tell me which two variables are strongly correlated) that I can check correlations among those 2000+ variables quickly?

Thank you very much! I am totally lost......

Array

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / How to check correlation among 2000+ variables in a quick way?
How to check correlation among 2000+ variables in a quick way?

0 Response to How to check correlation among 2000+ variables in a quick way?

Post a Comment

Home / Data Cleaning / Data management / Data Processing / How to check correlation among 2000+ variables in a quick way? How to check correlation among 2000+ variables in a quick way?

0 Response to How to check correlation among 2000+ variables in a quick way?