Does anyone know how reclink chooses which potential matches to report? Does it effectively sort the potential matches by similarity score and start with the match with the highest score (greedy algorithm)? Or does it use some sort of optimal algorithm? Or something else?
Similarly, for people who use matchit, how do you choose which potential matches to use when doing a 1:1 fuzzy match of two datasets?
I'm looking more for best practices than code, though I'd be interested in code that maximized the total similarity score if anyone had such a thing.
Thank you,
Kramer
Related Posts with Fuzzy matching: choosing potential matches (reclink/matchit)
creating an indix visitHello, I am trying to create an "index visit" where an individual has 90 days of eligibly period bef…
profileplotHi, I'm using Stata version 11, and despite repeated attempts, unable to download the profileplot pr…
Repeating 'tab' command for several variablesHappy New Year to everyone! I am trying to figure how to repeat the tab command for several IVs (Ag…
Repeated time valuesI don't understand. Stata is giving me this error message when trying to run VAR. I checked the data…
How to make this type of graph?Hello guys, Do you have any idea how to make the chart below? It is a histogram of frequencies of …
Subscribe to:
Post Comments (Atom)
0 Response to Fuzzy matching: choosing potential matches (reclink/matchit)
Post a Comment