Hello,
I am doing a difference in difference in panel data set, years 2001-2015. The observation units are US counties. The variables in my dataset are all industries (ex: fishing, manufacturing, tourism, etc...) measured both in terms of GDP by county and in terms of employment by counties (therefore, each industry enters in my dataset twice). For certain combinations of industry j and county i, I have missing values for all years;
Ex: Baldwin county may have no missing values for all industries except for fishing, for which there are missing values for all years. Similarly, Sussex county may have values for all industries except for mining extraction only (for all years), and so on and so forth.
The missing values are not random. Indeed they are suppressed data for matters of privacy (the values were generally small, so it was possible to date back to the firms).
I would like to ask:
1) how Stata deals, by default, with missing values when I run a regression;
2) how can I replace this missing value in the best and more reliable way.
I previously tried with "ipolate", but I read that the estimates would not be reliable
I hope someone can help me thank you in advance
Related Posts with Replacing missing values in a panel data
FE RE ModelHello everyone. I wanted to know apart from testing which model to use , fixed effects or random eff…
Weights in panel dataHi, I am working with a set of panel data where I am looking at students' school results in differen…
Why does Stata 16 crash every time I close a do-file??Stata experts, Can anyone help me figure out why stata 16 crashes every time I close a do-file? It …
how can i combine two datasets like this?Hi, I want to combine two datasets. Here is an example of what i have: Dataset A: id Country Gen…
Looping through multiple levels of variables not workingHello, I am working on a project but I am struggling with my stata code. I have a data set that I cr…
Subscribe to:
Post Comments (Atom)
0 Response to Replacing missing values in a panel data
Post a Comment