Hi guys,

hope you are enjoying festivities.

So basically I am trying to perform a fuzzy matching between the two following databases (where variable "prd" in Master DataBase is the equivalent of variable "drug_name" in using DataBase):

Master DataBase:

Code:
* Example generated by -dataex-. To install: ssc install dataex
clear
input str4 atc3no0 str18 prd float Year
"A7F"  "ACIDOPHILUS"        2013
"A7F"  "ACIDOPHILUS"        2014
"A7F"  "ACIDOPHILUS"        2015
"M5X"  "ARTHRI-FLEX"        2013
"M5X"  "ARTHRI-FLEX"        2014
"M5X"  "ARTHRI-FLEX"        2015
"A12A" "CALCIUM 600"        2014
"A12A" "CALCIUM 600"        2015
"A12A" "CALCIUM 600+D3"     2014
"A12A" "CALCIUM 600+D3"     2015
"A12A" "CALCIUM 600+D3 PLU" 2014
"A12A" "CALCIUM 600+D3 PLU" 2015
"A12A" "CALCIUM CIT/VIT D"  2013
"A12A" "CALCIUM CIT/VIT D"  2014
"A12A" "CALCIUM CIT/VIT D"  2015
"A12A" "CALCIUM/VIT D"      2008
"A12A" "CALCIUM/VIT D"      2009
"A12A" "CALCIUM/VIT D"      2010
"A12A" "CALCIUM/VIT D"      2011
"A12A" "CALCIUM/VIT D"      2012
"A12A" "CALCIUM/VIT D"      2013
"A12A" "CALCIUM/VIT D"      2014
"A12A" "CALCIUM/VIT D"      2015
"V3X"  "CINNAMON"           2008
"V3X"  "CINNAMON"           2009
"V3X"  "CINNAMON"           2010
"V3X"  "CINNAMON"           2011
"V3X"  "CINNAMON"           2012
"V3X"  "CINNAMON"           2013
"V3X"  "CINNAMON"           2014
"V3X"  "CINNAMON"           2015
"C1B"  "FISH OIL"           2012
"C1B"  "FISH OIL"           2013
"C1B"  "FISH OIL"           2014
"C1B"  "FISH OIL"           2015
"V3X"  "FLAXSEED OIL"       2013
"V3X"  "FLAXSEED OIL"       2014
"V3X"  "FLAXSEED OIL"       2015
"B3X"  "FOLIC ACID"         2014
"B3X"  "FOLIC ACID"         2015
"M5X"  "GLUCOSAMINE/CHONDR" 2008
"M5X"  "GLUCOSAMINE/CHONDR" 2009
"M5X"  "GLUCOSAMINE/CHONDR" 2010
"M5X"  "GLUCOSAMINE/CHONDR" 2011
"M5X"  "GLUCOSAMINE/CHONDR" 2012
"M5X"  "GLUCOSAMINE/CHONDR" 2013
"M5X"  "GLUCOSAMINE/CHONDR" 2014
"B3A1" "IRON"               2014
"B3A1" "IRON"               2015
"C1B"  "KRILL OIL OMEGA"    2012
"C1B"  "KRILL OIL OMEGA"    2013
"C1B"  "KRILL OIL OMEGA"    2014
"C1B"  "KRILL OIL OMEGA"    2015
"S1M"  "LUTEIN"             2014
"S1M"  "LUTEIN"             2015
"H4X"  "MELATONIN"          2008
"H4X"  "MELATONIN"          2009
"H4X"  "MELATONIN"          2010
"H4X"  "MELATONIN"          2011
"H4X"  "MELATONIN"          2012
"H4X"  "MELATONIN"          2013
"H4X"  "MELATONIN"          2014
"H4X"  "MELATONIN"          2015
"A12C" "MGO"                2013
"A12C" "MGO"                2014
"A12C" "MGO"                2015
"A11X" "NIACIN FLUSH FREE"  2011
"A11X" "NIACIN FLUSH FREE"  2012
"A11X" "NIACIN FLUSH FREE"  2013
"A11X" "NIACIN FLUSH FREE"  2014
"A11X" "NIACIN FLUSH FREE"  2015
"A11A" "PRENATAL"           2010
"A11A" "PRENATAL"           2011
"A11A" "PRENATAL"           2012
"A11A" "PRENATAL"           2013
"A11A" "PRENATAL"           2014
"A11A" "PRENATAL"           2015
"V3X"  "PRENATAL DHA"       2012
"V3X"  "PRENATAL DHA"       2013
"V3X"  "PRENATAL DHA"       2014
"V3X"  "PRENATAL DHA"       2015
"A11A" "PRENATAL VIT/DHA"   2013
"A11A" "PRENATAL VIT/DHA"   2014
"A11A" "PRENATAL VIT/DHA"   2015
"G2X9" "SOY ISOFLAVONES EX" 2011
"C1B"  "TRIPLE OMEGA CMPLX" 2014
"C1B"  "TRIPLE OMEGA CMPLX" 2015
"A11F" "VIT B12"            2013
"A11F" "VIT B12"            2014
"A11F" "VIT B12"            2015
"D8A"  "AVAGARD"            2004
"D8A"  "AVAGARD"            2005
"D8A"  "AVAGARD"            2006
"D8A"  "AVAGARD"            2007
"D8A"  "AVAGARD"            2008
"D8A"  "AVAGARD"            2009
"D8A"  "AVAGARD"            2010
"D8A"  "AVAGARD"            2011
"D8A"  "AVAGARD"            2012
"D8A"  "AVAGARD"            2013
end

Using DataBase:

Code:
* Example generated by -dataex-. To install: ssc install dataex
clear
input str4 atc3no0 str100 drug_name int priority_year byte(priorityUS finalphase)
"A1C" "PEGylated oral insulin, Biocon"                                   2001 1 5
"A1C" "insulin, human"                                                   1982 0 9
"A1C" "insulin lispro"                                                   1994 1 9
"A1C" "AFREZZA"                                                          1994 1 9
"A1C" "drug delivery system, oral insulin, Generex Biotechnology"        1998 1 .
"A1C" "ORMD 0801"                                                        2005 1 5
"A1C" "ORMD 0801"                                                        2005 1 5
"A1C" "insulin aspart"                                                   1985 0 9
"A1C" "insulin glulisine"                                                2001 0 9
"A1C" "insulin lispro"                                                   1989 1 9
"A1C" "drug delivery system, oral insulin, Generex Biotechnology"        1998 1 .
"A1C" "insulin, human"                                                   1990 0 9
"A1C" "insulin degludec"                                                 2003 1 9
"A1C" "drug delivery system, oral insulin, Generex Biotechnology"        1998 1 .
"A1C" "NASULIN"                                                          2003 1 5
"A1C" "insulin lispro"                                                   1994 1 9
"A1C" "drug delivery system, TRANSFERSOMES transdermal insulin"          1990 0 5
"A1C" "insulin lispro"                                                   1989 1 9
"A1C" "insulin glulisine"                                                2001 0 9
"A1C" "ALBULIN"                                                          2000 1 2
"A1C" "ORMD 0801"                                                        2005 1 5
"A1C" "insulin lispro"                                                   1994 1 9
"A1C" "drug delivery system, oral insulin, Generex Biotechnology"        1998 1 .
"A1C" "insulin aspart"                                                   1985 0 9
"A1C" "AI 401"                                                           1990 1 5
"A1C" "drug delivery system, MEDUSA, human insulin, FLAMEL Technologies" 1995 0 5
"A1C" "NASULIN"                                                          2001 1 5
"A1C" "insulin degludec"                                                 2003 1 9
"A1C" "insulin glargine"                                                 1988 0 9
"A1C" "insulin aspart"                                                   1996 0 9
"A1C" "insulin lispro"                                                   1989 1 9
"A1C" "insulin lispro"                                                   1994 1 9
"A1C" "ALBULIN"                                                          2000 1 2
"A1C" "drug delivery system, inhaled insulin, Pfizer"                    1994 1 9
"A1C" "insulin lispro"                                                   1989 1 9
"A1C" "insulin lispro"                                                   1994 1 9
"A1C" "ORMD 0801"                                                        2005 1 5
"A1C" "insulin degludec + liraglutide"                                   2007 0 9
"A1C" "insulin peglispro"                                                2008 1 6
"A1C" "insulin glulisine"                                                1997 0 9
"A1C" "insulin lispro"                                                   1989 1 9
"A1C" "AFREZZA"                                                          1994 1 9
"A1C" "NASULIN"                                                          2003 1 5
"A1C" "insulin aspart"                                                   1996 0 9
"A1C" "NASULIN"                                                          2001 1 5
"A1C" "insulin aspart + insulin degludec"                                2003 1 9
"A1C" "drug delivery system, inhaled insulin, Pfizer"                    1994 1 9
"A1C" "ORMD 0801"                                                        1995 0 5
"A1C" "insulin lispro"                                                   1989 1 9
"A1C" "AI 401"                                                           1990 1 5
"A1C" "insulin detemir"                                                  1993 0 9
"A1C" "ORMD 0801"                                                        1995 0 5
"A1C" "NASULIN"                                                          2003 1 5
"A1C" "insulin glulisine"                                                1997 0 9
"A1C" "NASULIN"                                                          2003 1 5
"A1C" "NASULIN"                                                          2001 1 5
"A1C" "insulin lispro"                                                   1994 1 9
"A1C" "PEGylated oral insulin, Biocon"                                   2001 1 5
"A1C" "insulin peglispro"                                                2008 1 6
"A1C" "insulin aspart + insulin degludec"                                2003 1 9
"A1C" "NASULIN"                                                          2001 1 5
"A1H" "drug delivery system, GITS glipizide"                             1989 1 9
"A1H" "drug delivery system, GITS glipizide"                             1989 1 9
"A1H" "glimepiride"                                                      1979 0 9
"A1H" "drug delivery system, modified release gliclazide, Servier"       1999 0 9
"A1J" "metformin + dapagliflozin"                                        1999 1 8
"A1J" "metformin"                                                        1998 1 9
"A1J" "GLUMETZA"                                                         1997 1 9
"A1J" "glibenclamide + metformin"                                        1998 0 9
"A1J" "metformin + dapagliflozin"                                        1999 1 8
"A1J" "GLUMETZA"                                                         1997 1 9
"A1J" "canagliflozin + metformin"                                        2003 1 8
"A1J" "canagliflozin + metformin"                                        2003 1 8
"A1J" "fenofibrate + metformin"                                          2002 0 6
"A1J" "metformin"                                                        1998 1 9
"A1J" "drug delivery system, extended-release metformin, Actavis"        1998 1 9
"A1J" "drug delivery system, extended-release metformin, Actavis"        1998 1 9
"A1K" "pioglitazone + glimepiride"                                       2003 0 9
"A1K" "balaglitazone"                                                    1996 1 6
"A1K" "pioglitazone"                                                     1985 0 9
"A1K" "rosiglitazone"                                                    1987 0 9
"A1K" "pioglitazone"                                                     1985 0 9
"A1K" "metformin + rosiglitazone"                                        1997 0 9
"A1K" "pioglitazone + metformin extended-release"                        2002 1 9
"A1K" "rosiglitazone"                                                    1992 0 9
"A1K" "englitazone"                                                      1985 1 2
"A1K" "lobeglitazone"                                                    2002 0 .
"A1K" "englitazone"                                                      1987 1 2
"A1K" "englitazone"                                                      1987 1 2
"A1K" "pioglitazone"                                                     1978 0 9
"A1K" "englitazone"                                                      1985 1 2
"A1K" "rosiglitazone"                                                    1991 0 9
"A1K" "NC 2100"                                                          1994 0 2
"A1K" "rosiglitazone"                                                    1987 0 9
"A1K" "rosiglitazone"                                                    1991 0 9
"A1K" "pioglitazone + metformin"                                         1995 0 9
"A1K" "pioglitazone + metformin extended-release"                        2002 1 9
"A1K" "rosiglitazone"                                                    1992 0 9
"A1K" "rosiglitazone"                                                    1991 0 9
"A1K" "rosiglitazone"                                                    1992 0 9
end
My aim is to perform a fuzzy matching between the two databases using as variables atc3no0 and the drug name with a lower weight on the name of the product. I have read about reclink but actually am not able to understand if it can fits to my needs.

Can you please help me?


Edit: I have done something along these lines but an error ") required" occurs:
Code:
reclink atc3no0 prd using output_estrazionedatiDS13_US, gen(myscore) idm(id_mas) idu(id_us) wmatch(10 2)
Federico