Hi,
I am working with panel data where I have 9 different regions and in each region there are a different number of schools. The time period is 10 months. What I am trying to do is to collapse the variable sick-absence in percentage among the teachers in each school for each region and period, I therefor collapse by region and time. The weight that I want to use is the number of pupils at each school, making schools that have a larger number of students have more weights compared to the smaller once. My problem is that I don't understand if stata takes into account that I am collapsing by region and time period, is the weighting done by the number of pupils in each region and period or the pupils in all regions and all periods? The second problem that I have is that I want to have the number of pupils for each region and period but without any weight, is there any good solution for that?
My code so far:
collapse sick_absence [fweight=number_of_students], by(city timeperiod)
Related Posts with Collapse by several variables
What Estimation Method to use? (PSM, Probit, Logit, Heckman)Hi all, I'm currently writing my thesis in Finance on the explanatory power of ESG scores on the de…
Second minimum dateI have different date variables and want to find the second minimum date. That is when you order the…
max or min of rows---- bysort and generateHi statalist community, I need to generate two variables for my research. Code: * Example genera…
placebo test using -permute-Dear statalist, I'm trying to do a placebo test in a DID by randomly assigning "treat" to my sample…
Change xlabel in a graphHello everyone, I'm working with a time series data set and I'm making some graphs, using the tslin…
Subscribe to:
Post Comments (Atom)
0 Response to Collapse by several variables
Post a Comment