Hi, all,



I had a data set of 2000+ variables (all binary or categorical), for example, 1000+ are characteristics of people (such as nationality, gender, etc.), the other 1000+ are fruits and vegetables (with 1 indicate the person likes it, and 0 indicates the person does not like). For example, in the picture I attached, we can see 1. most Columbians love Jackfruit; 2. most female like watermelon and pineapple; 3. most German female like apple...etc.

Is there a way (for example, maybe there is a code that STATA could automatically run many regressions and tell me which two variables are strongly correlated) that I can check correlations among those 2000+ variables quickly?

Thank you very much! I am totally lost......



Array