Hi,

I would like to merge two datasets. Both datasets have the following columns: Year Gvkey Cusip CompanyName. These variables all have some missing values.

However, to maximize the number of observations after merging two datasets, I would like to use the following methods:

1. merge using year gvkey first, but if missing gvkey, then merge using year cusip.
2. merge using year gvkey first, but if missing gvkey, then merge using year company name (fuzzy match).

Could anyone share some thoughts on how to achieve these? Thanks!!