Aggregating Data/ Loop

Hello All,

I have a longitudinal panel data set with 633 constituency codes (_pcon) ranging from 0 to 632. There are 8 'waves' (ranging 1-8) for each constituency, each representing a different time that the data was collected. I have a binary indicator if an individual at each occasion is unemployed or not. There are 278,417 observations in total, each with a corresponding _pcon and (multiple) wave. I would like to aggregate my unemployment data to calculate the mean unemployment rate for each constituency in each wave, i.e. a percentage of unemployment for each constituency 0 to 632 and each wave 1 to 8 in my sample, resulting in 5064 mean values.

So far I been doing this:

Code:

. summ unemployed if _pcon==0 & wave ==1

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         80       .0875    .2843491          0          1


. summ unemployed if _pcon==0 & wave ==2

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         66    .0909091    .2896827          0          1


. summ unemployed if _pcon==0 & wave ==3

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         61    .0983607    .3002731          0          1


. summ unemployed if _pcon==0 & wave ==4

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         54    .0555556    .2312123          0          1


. summ unemployed if _pcon==0 & wave ==5

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         58    .0172414    .1313064          0          1


. summ unemployed if _pcon==0 & wave ==6

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         86    .1046512    .3078988          0          1


. summ unemployed if _pcon==0 & wave ==7

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         75    .0933333    .2928579          0          1


. summ unemployed if _pcon==0 & wave ==8

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
  unemployed |         74    .1756757    .3831416          0          1

I have then been manually generating the unemployment percent via the Mean*100. I realise that this method is extremely inefficient. I have been trying to research a better way of doing this however am new to STATA and have been unsuccessful.

Moreover, to merge with my existing data file in another software, if possible I would require a mean unemployment for each of my 228, 417 observations so that I can copy and paste the mean aggregate unemployment for each observation.

I was hoping there was some way (maybe a loop) of doing it in Stata?

I hope that I have made myself clear. Many thanks in advance for any advice/help, Ellie.

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Aggregating Data/ Loop
Aggregating Data/ Loop

0 Response to Aggregating Data/ Loop

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Aggregating Data/ Loop Aggregating Data/ Loop

Related Posts with Aggregating Data/ Loop

0 Response to Aggregating Data/ Loop

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Aggregating Data/ Loop
Aggregating Data/ Loop