HI,
We are running an analysis that estimates promotion hazards for 13,534 workers of different races (White, Black, Asian, Hispanics) in a US federal agency. The regressions control for nearly perfectly observed productivity measures that are supposed to matter for promotions. Our results suggest statistically different promotion hazards in the following order: Blacks < Hispanics < Asians < Whites. Blacks have up to a 70% lower hazard of promotion at the highest grade.
We want to check whether this disparity is a result of stereotypes and statistical discrimination. Thus, we estimate the same hazard models, but replacing the race categorical variables with variables that may lead to stereotypes, in particular (a) Median income of each of the four races, (b) Average educational levels of each of the four races, and (c) GDP of race-origin countries. Most importantly, we do NOT have within group/race variation on any of these three variables, so basically, we have only four values per variable (one value per race). We obtain results suggesting a strong positive relationship between these variables and promotion hazards.
Question: Is it OK to do what we did? That is, replace a categorical variable that can take on four possible values (Blacks, Hispanics, Asians, Whites) with one variable, say median income for each of the four races?
Thank you so much.
Deepak
Related Posts with Variance in dependent variable
ATT (Average treatment effects on the treated)Hi, I am attempting to estimate the impact of hosting the World Cup on the growth rate of countries…
Predicting and saving residuals after running regressions on several sample unitsDear Statalist, I am running regressions on farm economic data which I have set as panel data - eac…
Odd results when combining near collinearity with simultaneityI am exploring what happens when two regressors are collinear, and at least one is simultaneous with…
Odds ratio for continous variables in logistic regressionHi everybody I have a question about the interpretation of the following logistic regression: The …
Predicting dependent variableHi everyone, I have a linear model Y=bX + e. I know that, in order to obtain the predicted value of…
Subscribe to:
Post Comments (Atom)
0 Response to Variance in dependent variable
Post a Comment