Hello guys,
I am currently trying to do fuzzy matching of two "string" variables (var1 and var2) in my dataset using Levenshtein Distance (-strdist package), which seems to fit my needs.
The only problem that I am having is that I need to calculate the levenshtein distance of each observation in variable 1 with each observation of variable 2, and I am not sure how. As of now, when running strdist var1 var2, i get a pairwise calculation of levenshtein distance between observations in var1 and var2 from the same row. I was wondering if anyone might know how to best implement it?
Best,
Fredrick
Related Posts with Levenshtein Distance (fuzzy matching) with a loop
Time Trends for Panel DataHello, I am writing a descriptive paper on substance use estimates across 4 years (2013-2017) using…
regression effect sizes after using mi estimateHi STATA listers, I am using mi estimate to complete a multiple regression analysis. I would like t…
New version of invdesc on SSCThanks as always to Kit Baum, a new version of the invdesc package is now available for download fro…
New version of listtab on SSCThanks once again to Kit Baum, a new version of the listtab package is now available for download fr…
Condition application logistic regressionsHi everyone, I was wondering what the conditions of application of a logistic regression are and how…
Subscribe to:
Post Comments (Atom)
0 Response to Levenshtein Distance (fuzzy matching) with a loop
Post a Comment