Hello guys,
I am currently trying to do fuzzy matching of two "string" variables (var1 and var2) in my dataset using Levenshtein Distance (-strdist package), which seems to fit my needs.
The only problem that I am having is that I need to calculate the levenshtein distance of each observation in variable 1 with each observation of variable 2, and I am not sure how. As of now, when running strdist var1 var2, i get a pairwise calculation of levenshtein distance between observations in var1 and var2 from the same row. I was wondering if anyone might know how to best implement it?
Best,
Fredrick
Related Posts with Levenshtein Distance (fuzzy matching) with a loop
How to calculate index value for eq5d pleaseQuestion …
Individual fixed effects vs. school fixed effectsHi all, I am deciding between a school and year fixed effects model and an individual fixed effects…
Trouble with interpolating data: code is interpolating non-missing dataHi everyone, I am trying to interpolate panel data using this code: Code: ipolate v2pesecsch year,…
Extracting macro in a loopDear Statalist Users, A much simplified version of my current dataset is as follows: Code: * Exa…
How to remove all characters that are not part of a predefined list?How can I remove from a STRING VARIABLE all characters that are not part of a predefined list? For e…
Subscribe to:
Post Comments (Atom)
0 Response to Levenshtein Distance (fuzzy matching) with a loop
Post a Comment