Hi, I am trying fuzzy string matching from two files using 'dtalink' package. In both files I have alphanumeric firmname 1800flowerscom, 7eleven and 3m.
'dtalink' only matches 1800flowerscom and 7eleven from both file but not the 3m. I was wondering why. To add a blank at both ends of the string, I use the code: gen newfirmname = " "+firmname+" "
Fuzzy matching code: dtalink firmname 10 0 using file1, block(firmname_f | firmname_l ) cutoff(10) calcweights
Thanks.
Related Posts with Fuzzy matching alphanumeric name
Generate a MatrixHi all, I am having a problem generating a matrix from data. Suppose I have a dataset like this …
Data reshapingDear Statalist: I got the following data set: Date Status AB CD EF 14-Mar-20 Low 81 3 7 14-M…
dates in Excel into monthly dates in STATAHello everybody, I have a spreadsheet in Excel 16, with monthly mutual funds' returns. The column o…
Income inequality panel dataI would like to perform a difference-in-differences regression to determine the effect of the nation…
Panel data set restrictions for specific ID and time periodHello, I hope you are all safe and healthy. I have Stata version 14 and a panel data set consists o…
Subscribe to:
Post Comments (Atom)
0 Response to Fuzzy matching alphanumeric name
Post a Comment