I am doing research on racial discrimination at the loan approval decisions, i.e. whether minority borrowers have a lower approval probability than similar white borrowers, c.p.
Actually, previous research has already raised evidence of discrimination against minorities using a logit regression in the following form,
P(approval)= f (minority status, loan features, borrower characteristics, ..., and some other controls)
A negative and statistically significant coefficient from this logit model reveals that minority status reduces the loan approval probability, c.p.
Nevertheless, this model is unable to distinguish between differential treatment and disparate impact discrimination. In the form of differential treatment discrimination, two otherwise equal borrowers - except their race and ethnicity - will be treated differently by lenders. The second form - disparate impact discrimination - has a legal cover but can have an unintentional disparate impact against minority borrowers. One example is that lenders could set a minimum income level for all borrowers. This seemingly race-blind requirement will most likely negatively impact minority borrowers but not white borrowers because on average minorities have a lower income level than white.
The best way and the only way to isolate differential treatment discrimination in loan approvals is the paired testing methodology. Specifically, two applicants with the same credit histories and in need of the same type of loan would apply for a mortgage at the same lender. In this setting, the observed differences in treatment only reflect the differential treatment discrimination because two applicants are identically qualified. But the paired testing methodology is hardly practical in real life, because of the fact that pushing pair testing into the loan approval stage might be illegal and face high legal bills.
I noticed that the propensity score matching is used to balance the distribution of covariates, in other words, it will match the observations and make them the most similar in the covariables except the treatment indicator - in our case, the minority indicator. In other words, the propensity score matching seems perfectly imitate the paired testing. The minority-status impact is just the difference between the observed value of one observation and the observed value of its matching. Race as a treatment seems to be unreasonable. But maybe we can assume that a borrower enrolled in a "minority program" when he/she was born. The borrower enrolled in this minority program might have a lower income or other disadvantages in the future.
In fact, when I run the baseline logit model,
Code:
logit approval minority income_w dti20 dti20_30 dti30_36 dti36_49 dti50_60 fico680_699 fico700_719 fico720_739 ltv80 ltv80_85 ltv85_90 ltv90_95 origination_2019 refinance female age62 lender_top100 shadowbank fintech aus tract_minority_population_percen tract_owner_occupied_units tract_one_to_four_family_homes tract_median_age_of_housing_unit cra fhfa_index
Code:
Logistic regression Number of obs = 250,000
LR chi2(28) = 55744.90
Prob < chi2 = 0.0000
Log likelihood = -88966.138 Pseudo R2 = 0.2386
--------------------------------------------------------------------------------------------------
approval | Coefficient Std. err. z P<|z| [95% conf. interval]
---------------------------------+----------------------------------------------------------------
minority | -.391179 .0152328 -25.68 0.000 -.4210346 -.3613233
income_w | .004729 .0001977 23.93 0.000 .0043416 .0051163
dti20 | 2.967021 .0540445 54.90 0.000 2.861096 3.072947
dti20_30 | 3.664266 .0424642 86.29 0.000 3.581038 3.747495
dti30_36 | 3.892662 .0413462 94.15 0.000 3.811625 3.973699
dti36_49 | 3.960401 .0378111 104.74 0.000 3.886293 4.034509
dti50_60 | 3.709353 .038279 96.90 0.000 3.634328 3.784378
fico680_699 | .0205687 .0424997 0.48 0.628 -.0627291 .1038665
fico700_719 | .11979 .0419051 2.86 0.004 .0376574 .2019225
fico720_739 | .0570352 .0466314 1.22 0.221 -.0343607 .1484311
ltv80 | -.2817957 .0251547 -11.20 0.000 -.331098 -.2324933
ltv80_85 | -.043908 .0258024 -1.70 0.089 -.0944797 .0066637
ltv85_90 | -.2121639 .0289899 -7.32 0.000 -.2689831 -.1553448
ltv90_95 | -.3095127 .0256459 -12.07 0.000 -.3597778 -.2592476
origination_2019 | .2395657 .0132166 18.13 0.000 .2136617 .2654698
refinance | -1.23423 .0219976 -56.11 0.000 -1.277345 -1.191116
female | -.02576 .0135202 -1.91 0.057 -.0522592 .0007392
age62 | -.3451483 .0198167 -17.42 0.000 -.3839883 -.3063083
lender_top100 | -.4454505 .0154397 -28.85 0.000 -.4757118 -.4151892
shadowbank | -.0205853 .0167077 -1.23 0.218 -.0533318 .0121612
fintech | -.1228223 .0212574 -5.78 0.000 -.1644859 -.0811586
aus | 2.048448 .0218263 93.85 0.000 2.005669 2.091226
tract_minority_population_percen | .003672 .000289 12.71 0.000 .0031055 .0042384
tract_owner_occupied_units | .0001755 .0000218 8.07 0.000 .0001329 .0002182
tract_one_to_four_family_homes | -.0000885 .0000165 -5.35 0.000 -.0001209 -.0000561
tract_median_age_of_housing_unit | -.0005254 .0004295 -1.22 0.221 -.0013672 .0003164
cra | -.1248721 .0165081 -7.56 0.000 -.1572274 -.0925168
fhfa_index | .0417232 .0042321 9.86 0.000 .0334285 .0500179
_cons | -3.884013 .0679459 -57.16 0.000 -4.017184 -3.750841
--------------------------------------------------------------------------------------------------Code:
teffects psmatch (approval) (minority income_w dti20 dti20_30 dti30_36 dti36_49 dti50_60 fico680_699 fico700_719 fico720_739 ltv80 ltv80_85 ltv85_90 ltv90_95 origination_2019 refinance female age62 lender_top100 shadowbank fintech aus tract_minority_population_percen tract_owner_occupied_units tract_one_to_four_family_homes tract_median_age_of_housing_unit cra fhfa_index)
Code:
Treatment-effects estimation Number of obs = 250,000
Estimator : propensity-score matching Matches: requested = 1
Outcome model : matching min = 1
Treatment model: logit max = 3
------------------------------------------------------------------------------
| AI robust
approval | Coefficient std. err. z P<|z| [95% conf. interval]
-------------+----------------------------------------------------------------
ATE |
minority |
(1 vs 0) | -.043562 .0026974 -16.15 0.000 -.0488489 -.0382751
------------------------------------------------------------------------------Can we use the propensity score matching to imitate the paired testing and isolate the differential treatment discrimination?
Is this method feasible?
Thanks!
0 Response to Propensity Score Matching to estimate racial discrimination
Post a Comment