I want to create a code giving me summary statistics (mean, min and max) of each subgroup of a single categorical variable.
Example: I want to display mean age, min and max within each age group. The categorical variable is called agegroup, and looks like this:
Code:
codebook agegroup ---------------------------------------------------------------------------------------- agegroup Age strata within the cohort ---------------------------------------------------------------------------------------- type: numeric (float) label: agegroup_lbl range: [1,6] units: 1 unique values: 6 missing .: 0/9,163 tabulation: Freq. Numeric Label 1,113 1 20.0-24.9 1,733 2 25.0-29.9 1,303 3 30.0-34.9 1,504 4 35.0-39.9 1,742 5 40.0-44.9 1,768 6 45.0-49.9
Code:
. codebook PartAge ---------------------------------------------------------------------------------------- PartAge (unlabeled) ---------------------------------------------------------------------------------------- type: numeric (float) range: [20,49.9] units: .1 unique values: 300 missing .: 0/9,163 mean: 35.9571 std. dev: 8.55075 percentiles: 10% 25% 50% 75% 90% 24.3 28.4 36.5 43.4 47.4
Code:
egen agegroup_mean = mean(PartAge), by(agegroup)
Sigrid
0 Response to Summary statistics for subgroups of a categorical variable
Post a Comment