Dear STATA community,
Please help!
I have messy data containing over thirty variations of different words (e.g., leadership) in four columns for over 800 observations. A screenshot of the data is attached. I could not use the dataex command due to the large size of the file.
How do I quickly calculate how many times each of the words appear among all four columns?
I also need to do crosstabulations between these two words at a time. How would I do it?
Do I need to recode each word into a numeric value?
I would appreciate your help!
Olena
Related Posts with Messy string data: how to do crosstabulations and descriptives
Gender wage gap, hourly wage or weekly wageHello everyone, I am doing a project on gender wage gap using Oaxaca Decomposition. I am wondering …
Combining the values of different categories into ONE with a repeated IDI have a variable (foodgr) which is divided into 4 categories (1,2,3,4). I have another variable, v…
Mediation analysis (bias-corrected bootstrapping): Using panel survey data with fixed effects (sem/gsem/xtdpdml command)Hello, I am new to the Stata forum (let me know if I have not correctly followed the forum's rules)…
How to add scatter plot to xtline and modify its colours accordinglyHi, I'm trying to add orginial data to xtline. Sometimes a unit exhibits a constant variable over se…
Editing graphHi, I need to edit a graph that starts from 0 to start from 10. I know how to change the x axis ran…
Subscribe to:
Post Comments (Atom)
0 Response to Messy string data: how to do crosstabulations and descriptives
Post a Comment