Hi all,

Apologies in advance, I'm an inexperienced undergrad.

I have a dataset which has 1 household identifier variable, hhid, 2 profit variables (before an intervention, labeled profit_bl, and after an intervention, labeled profit_el) and 1 binary variable which indicates whether the observation was before or after the intervention, followup.

Seeing as the values of profit_el and porfit_bl for each household member are the same, I want to reduce the observations in profit_bl and profit_el to 1 per household so that the baseline and endline profit values belong to the same observation. Previous threads have suggested using tag() with hhid but it seems a little more complex than that.

Here is a screenshot of the data in the 'browse' tab:

Array

Thanks very much in advance!

Dan