Hi All,

I'm trying to perform a survey data analysis, but these data came in 35 different stata files. Problem is that I only need a few variables from each file.

I've tried merging, but it wasn't meaningful since the variables are different in different files, so, I was left with empty cells after merging.

Please, how can I successfully extract the specific variables (up to 12 variables from 8 different files) I need for my analysis without losing the survey's core characteristics (I learnt that for a survey analysis to produce reliable variance estimates, that the full sample size should be used for analysis.

Below are the examples of the two stata file and two of the variables I'd like to extract from each for use in a single analysis. For instance, college graduates are likely to be more informed to take care of their health, such that their chances of being physically disabled are limited.

Thank you for your time.

sect2_education.dta
What is the highest educational level [NAME] completed?

sect3_health.dta
Did [NAME] have to stop his/her usual activities because of this [ILLNESS/INJURY