Hello all,
I am performance ols regression, something like: regress y i.x1##i.x2 x3 x4 x5, cluster(i)
Sample is 20k observations.
Y is a logged count variable and is not a rare event
x1 is a dichotomous variable that is not super common, but I don't think is problematic (takes on the value of 1 for 600 out of 20000 obs)
However, x2 is a rare event (the variable is mostly 0s with a few 1s).
I am interacting x1 and x2 because the interaction is of theoretical interest for my study. However, there are only about 20 observations for which both x1 and x2 = 1 simultaneously. Is this a concern for proceeding with the study I want to conduct? There is a valid structural explanation for why it is only 20 obs where x1 and x2 = 1, but I want to be sure that I am embarking on this course of study with sound footing.
Thanks for your input!
Related Posts with OLS regression where one independent variable is a rare event
Nijman and Verbeek (1992) test to formally test for attrition biasDear all, I desperately need help on the step by step process of using Nijman and Verbeek (1992) te…
Wage distribution percentile differencesHi, everyone! Right now I have to do research about how the minimum wage affects wage distribution.…
File read and tokenize localsHi everyone! I have been trying to tokenize locals that are produced by file read. However, weird t…
Multiple hypothesis testing command updatedFor those interested in multiple hypothesis testing, a new version of wyoung is now available on SSC…
KINKYREG: new Stata command for instrument-free inference in linear regression models with endogenous regressorsI just released a brand-new Stata package called kinkyreg, which I developed jointly with Jan Kiviet…
Subscribe to:
Post Comments (Atom)
0 Response to OLS regression where one independent variable is a rare event
Post a Comment