Does anyone know how reclink chooses which potential matches to report? Does it effectively sort the potential matches by similarity score and start with the match with the highest score (greedy algorithm)? Or does it use some sort of optimal algorithm? Or something else?
Similarly, for people who use matchit, how do you choose which potential matches to use when doing a 1:1 fuzzy match of two datasets?
I'm looking more for best practices than code, though I'd be interested in code that maximized the total similarity score if anyone had such a thing.
Thank you,
Kramer
Related Posts with Fuzzy matching: choosing potential matches (reclink/matchit)
xtbcfe erroeDear All, I am trying to use the user written routine xtbcfe for bias corrected fixed effect estima…
Creating portfolios with double sorting and calculating mean returns of the portfolios.(please help)This is the third time I write this topic but did not get any answer from statalists members, and ca…
cmissing having no effectI am trying to create a line graph that does not connect when there are observations with missing da…
Issue with svyset, getting "command svyset_8 is unrecognized"Hello all, I am in the process of analyzing some data from the National Readmissions Database (NRD)…
How to print values in parenthesis when using a matrix with esttab?So I generate a lot of tables where I have to use parenthesis (e.g mean (SD) format). The problem is…
Subscribe to:
Post Comments (Atom)
0 Response to Fuzzy matching: choosing potential matches (reclink/matchit)
Post a Comment