BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Data cleaning - checking correct encoding of variables
Data cleaning - checking correct encoding of variables

Hello,

I'm very new to Stata and am trying to complete some data cleaning. I have a dataset with 5 variables and around 200 million observations. The variables are all numeric, and I would like to check that three of them have been encoded correctly, as they were originally categorical (string) variables. For example, I would like to know if the numerical code captures distinct countries for the country variable (there may be typos in the original categories, for instance).

The original string variables are not available, but Stata shows the country names in browse (the categorical variable), but treats the variable as numeric in the data editor. Is there any way to check what the equivalencies between the two are?

Thank you in advance for any help you might be able to give me!

Best wishes,
Clara

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Data cleaning - checking correct encoding of variables
Data cleaning - checking correct encoding of variables

0 Response to Data cleaning - checking correct encoding of variables

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Data cleaning - checking correct encoding of variables Data cleaning - checking correct encoding of variables

0 Response to Data cleaning - checking correct encoding of variables