Hello Stata users,
I started using Stata recently and at the moment I find myself in a dilemma. My database has millions of observations and about 30 variables. To get the regressions, I tried to balance my dataset with the worker identifiers variable (worker_id) with the time variable (year) first. The same worker can be found in several years, although it may be normal for it not to exist throughout the complete considered temporal sequence and only appear in a few years (from 2010 to 2018).
When executing the command "xtset worker_id year" the error "repeated time values within panel - r(451);" appears. How can I solve this problem?
Thank you for your attention.
Best regards!
Kate
Related Posts with Unbalanced dataset
Portfolio constructionHello, I have data of individual investors trading behavior. Below you see an abstract of my data. …
Calculate age from other occurrences and attribute it to specific observationHi, I'm working with consecutive censuses. I can follow the same individuals through several decade…
Creating Graphs with Regression OutputI am attempting to determine whether hours worked and wages have changed overtime differently for di…
Mediation Analysis - Stata sem and medsem commands with fixed effects modelsHello everyone, I am trying to perform a mediation analysis, in which the variables of concern are…
Reference period in new DID estimatorsHello all! I am running a few checks using the new heterogeneity-robust diff-in-diff estimators, spe…
Subscribe to:
Post Comments (Atom)
0 Response to Unbalanced dataset
Post a Comment