Hi, all,
I had a data set of 2000+ variables (all binary or categorical), for example, 1000+ are characteristics of people (such as nationality, gender, etc.), the other 1000+ are fruits and vegetables (with 1 indicate the person likes it, and 0 indicates the person does not like). For example, in the picture I attached, we can see 1. most Columbians love Jackfruit; 2. most female like watermelon and pineapple; 3. most German female like apple...etc.
Is there a way (for example, maybe there is a code that STATA could automatically run many regressions and tell me which two variables are strongly correlated) that I can check correlations among those 2000+ variables quickly?
Thank you very much! I am totally lost......
Array
Related Posts with How to check correlation among 2000+ variables in a quick way?
How to know whether the paralell trend assumption is satisfied by using the method of Borusyak, (2021) in DiD ?Hi all, I am following the method of Borusyak, (2021) by running the Code: did_imputation package. …
how to merge data using different key variable names?I want to merge data between variable1 in dataset1 and variable2, variable3, variable4 in dataset2 r…
How to compute constant on fixed effects Quantile regressionHi, I know that xtqreg command does not show any constant, however, as i am working on a genetics t…
How to know whether laws affect dependent variables and paralell trend being satisfied following Chaisemartin and D'Haultfoeuille ?I am using a package called Code: did_multiplegt to run following Chaisemartin and D'Haultfoeuille …
High R-square in ppmlDear Statalist community, I would like to clarify few things about using ppml on gravity data. 1. …
Subscribe to:
Post Comments (Atom)
0 Response to How to check correlation among 2000+ variables in a quick way?
Post a Comment