reghdfe takes hours - how to run panel regressions with FE faster?

Good morning,

I have a dataset with 4 million observations on 2 million students in two points in time (grade 5 and grade 8) from 5 different cohorts (observed between 2012 and 2019) from 15,000 schools. I am running the following panel regression with fixed effects:

Code:

qui reghdfe test_score proportion_females, absorb(school year student grade i.school#c.year) vce(cluster i.school)

which takes more than 2 hours. With another model specifications (with control variables instead of the student FE) one whole night was not enough to run one single regression.

Is there a way of running the regression faster? I need to run 20+ of those regressions because of different model specifications. Are there tricks to implement that reduce the computation time (e.g. reducing the number of decimals of the dependent and the independent variables...)?

I am using Stata/SE 15.1 for Mac (64-bit Intel) on a 2014 Macbook Pro (2.4 GHz Quad-Core Intel Core i7) with macOS Catalina 10.15.2.

Thank you in advance for your help and tips.

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / reghdfe takes hours - how to run panel regressions with FE faster?
reghdfe takes hours - how to run panel regressions with FE faster?

0 Response to reghdfe takes hours - how to run panel regressions with FE faster?

Post a Comment

Home / Data Cleaning / Data management / Data Processing / reghdfe takes hours - how to run panel regressions with FE faster? reghdfe takes hours - how to run panel regressions with FE faster?

Related Posts with reghdfe takes hours - how to run panel regressions with FE faster?

0 Response to reghdfe takes hours - how to run panel regressions with FE faster?

Post a Comment

Home / Data Cleaning / Data management / Data Processing / reghdfe takes hours - how to run panel regressions with FE faster?
reghdfe takes hours - how to run panel regressions with FE faster?