HI,
We are running an analysis that estimates promotion hazards for 13,534 workers of different races (White, Black, Asian, Hispanics) in a US federal agency. The regressions control for nearly perfectly observed productivity measures that are supposed to matter for promotions. Our results suggest statistically different promotion hazards in the following order: Blacks < Hispanics < Asians < Whites. Blacks have up to a 70% lower hazard of promotion at the highest grade.
We want to check whether this disparity is a result of stereotypes and statistical discrimination. Thus, we estimate the same hazard models, but replacing the race categorical variables with variables that may lead to stereotypes, in particular (a) Median income of each of the four races, (b) Average educational levels of each of the four races, and (c) GDP of race-origin countries. Most importantly, we do NOT have within group/race variation on any of these three variables, so basically, we have only four values per variable (one value per race). We obtain results suggesting a strong positive relationship between these variables and promotion hazards.
Question: Is it OK to do what we did? That is, replace a categorical variable that can take on four possible values (Blacks, Hispanics, Asians, Whites) with one variable, say median income for each of the four races?
Thank you so much.
Deepak
Related Posts with Variance in dependent variable
Separating epiweeks with corresponding yearsGood day all I was wondering if you could please help. I have a data set with years 2020 and 2021. …
could not estimate full model no observations r(2000)I'm a new of STATA , I have some data for analysis Multiple logistic regression and I got error r(20…
How would I create a 95% CI on a scatter plot?This might be very simple, but how would I show a 95% confidence interval on a scatter plot which pa…
â as missingThe imported data replaces the missing value symbol from the original data with an "â". My goal is t…
How toextract the results of various stata commands into matrices or mata universally and selectivelyFor stata commands(such as npresent, fsum, etc.) without a matrix in the return values(the result of…
Subscribe to:
Post Comments (Atom)
0 Response to Variance in dependent variable
Post a Comment