I was looking at a paper (link here) by Blackwell et al. (2010) on CEM in Stata.
In one example using an example data set, the authors ran CEM using the matching covariates such as age, education, black, nodegree, and re74.
For the imbalance measure for each covariate (univariate imbalance), all the measures became below 0.1 (where 0 means fully balanced and 1 means not at all balanced).
But why Multivariate L1 distance, which takes account for all the imbalance measures at a time, is relatively so high (nearly 0.51)?
I read the definition of how Multivariate L1 distance is caculated.
Is the above reason due to the fact that Multivariate L1 distance is calculated based on absolute difference of frequencies over all the matching covairates between treatment and control group?
In other words, does Multivariate L1 distance can be high because the range of the values for each matching covariate can differ from each other (e.g. age ranges 17-55 whereas black and nodegree have binary values) so that calculating absolute difference for all naturally generates higher Multivariate L1 distance?
I am really interested in to understand this and I could not find a good explanation for this on the internet.
If I am wrong, I am happy to hear someone's comment on this.
Related Posts with Why Multivariate L1 distance from Coarsened Exact Matching (CEM) is so high compared to Univariate imbalance for each matching covariate?
recording forward observations in panel dataDear Statalist, This is probably a trivial question, but I couldn't find a solution for it myself. …
incorrect panel data structureDear All, I am facing the following (incorrect) long panel data format. Code: * Example generated…
Jensen-Shannon Divergence between training and test set classesI'm working on Stata and I have a training and a test set. For each one I have a set of variables (h…
Shaping dataHello, I would like to transform my data from this way : order station up down 1 A 48 0 …
Interpreting an interaction term in a panel linear regression modelHi, I am exploring the link between the introduction of a stewardship policy and the consumption of…
Subscribe to:
Post Comments (Atom)
0 Response to Why Multivariate L1 distance from Coarsened Exact Matching (CEM) is so high compared to Univariate imbalance for each matching covariate?
Post a Comment