This is a new topic for me, so pardon my basic understanding of the heckman model. In fact, it's possible I should not be using a selection model at all, so I wanted to check first.
My dependent variable (individual-level) is species diversity captured by a birdwatcher in a district-time period. The independent variable is deforestation in the district-time period. Many individuals go looking for specific birds, so their data points are less useful for eliciting the general impact of deforestation on species diversity. The goal is to identify the individuals who capture everything in sight and focus on them for the analysis.
I plan to define a "veteran" birdwatcher as someone capturing a reasonably representative measure of diversity based on some predefined criteria. This could include the total trips they take, the number of months per year they go out, and whether they report all species during the trip. These predict veteran status but are not part of the outcome equation.
Initially, I dropped all observations that didn't meet the selection criteria. My understanding is that doing this biases my coefficients by truncating the distribution of error terms. Can I treat my issue as a selection model?
My idea is to generate a dummy=1 for veterans and 0 for non-veterans, based on the above criteria. Then I would estimate coefficients and standard errors with the heckman command. Is this the right approach? Is it a problem that I am "incidentally truncating" the data myself, and then correcting for it? Thanks.
Related Posts with Should I use a heckman selection model when extracting a subset of my data for analysis?
Panel data, want to see the regression results for two time periods and country setsHi, I have a panel dataset of with a variable country, year, university, sat score. (Similar variabl…
Asgen with missing observations for the weightDear Forum, I have a short question regarding the ASGEN command with weights. I am creating value w…
time series prediction assignmentBackground: As a researcher at the financial economics research department of the Federal Reserve Ba…
Dummy VariableHow do I create a dummy variable which takes the value 1 in a household when first two children are …
Increase in the number of observations after using RESHAPE commandDear community, My original data are in long shape. When I reshape long to wide there is an increas…
Subscribe to:
Post Comments (Atom)
0 Response to Should I use a heckman selection model when extracting a subset of my data for analysis?
Post a Comment