Hello all,
I am performance ols regression, something like: regress y i.x1##i.x2 x3 x4 x5, cluster(i)
Sample is 20k observations.
Y is a logged count variable and is not a rare event
x1 is a dichotomous variable that is not super common, but I don't think is problematic (takes on the value of 1 for 600 out of 20000 obs)
However, x2 is a rare event (the variable is mostly 0s with a few 1s).
I am interacting x1 and x2 because the interaction is of theoretical interest for my study. However, there are only about 20 observations for which both x1 and x2 = 1 simultaneously. Is this a concern for proceeding with the study I want to conduct? There is a valid structural explanation for why it is only 20 obs where x1 and x2 = 1, but I want to be sure that I am embarking on this course of study with sound footing.
Thanks for your input!
Related Posts with OLS regression where one independent variable is a rare event
Saving option within a programDear All, I have the following rough code (it needs further amendements): Code: capture program …
Calculate in a loop the min and max of a series of variablesGood morning I would like to create the following operations in a single cycle (to calculate min and…
Computing rolling lagged sums with bysortDear Statalist users, I have monthly time series data on flight_delays per air_carrier. For each ai…
New package sscpax on SSCThanks once again to Kit Baum, a new package sscpax is now available for download from SSC. In Stata…
Computing rolling lagged sums with bysortDear Statalist users, I have monthly time series data on flight_delays per air_carrier. For each ai…
Subscribe to:
Post Comments (Atom)
0 Response to OLS regression where one independent variable is a rare event
Post a Comment