Dear STATA community,
Please help!
I have messy data containing over thirty variations of different words (e.g., leadership) in four columns for over 800 observations. A screenshot of the data is attached. I could not use the dataex command due to the large size of the file.
How do I quickly calculate how many times each of the words appear among all four columns?
I also need to do crosstabulations between these two words at a time. How would I do it?
Do I need to recode each word into a numeric value?
I would appreciate your help!
Olena
Related Posts with Messy string data: how to do crosstabulations and descriptives
Interpreting quadratic terms in a multiple linear regression.Hi everyone on Statalist, I am working on a project and have run into a few obstacles. The purpose…
GLLAMM error not convergeHi all, I have date of 100 employees per company, for several companies. I would like to perform a …
Using matrix as lookup tableHello, I have two datasets. I am trying to get a new variable in the second dataset, which looks up…
Weights calculation with metapropHello, I am conducting a meta-analysis of prevalence proportions using a random effects model with …
Generate sequencial dicotomic variableHi all, I am trying to alternate position label (pos=6 and pos=12) in a scatter graph, for this pur…
Subscribe to:
Post Comments (Atom)
0 Response to Messy string data: how to do crosstabulations and descriptives
Post a Comment