Hello guys,
I am currently trying to do fuzzy matching of two "string" variables (var1 and var2) in my dataset using Levenshtein Distance (-strdist package), which seems to fit my needs.
The only problem that I am having is that I need to calculate the levenshtein distance of each observation in variable 1 with each observation of variable 2, and I am not sure how. As of now, when running strdist var1 var2, i get a pairwise calculation of levenshtein distance between observations in var1 and var2 from the same row. I was wondering if anyone might know how to best implement it?
Best,
Fredrick
Related Posts with Levenshtein Distance (fuzzy matching) with a loop
Panel data- setting id using group variableI am beginner in panel data. Please pardon if its a basic question. If I have 100 countries for whic…
Setting up variables for a Venn DiagramHi- trying to set up variables for a Venn Diagram; essentially I need a count across three categorie…
Medical student who needs help?Hi there, I know very little about STATA but I wanted to know how I could create new variables usin…
Movestay questionHi there, I am using movestay to examine the impact of innovation (Innovation) on capital structure…
Merging multiple panel data filesHello all, I have individual panel data and two state-level panels I seek to merge together. I have…
Subscribe to:
Post Comments (Atom)
0 Response to Levenshtein Distance (fuzzy matching) with a loop
Post a Comment