Hi,
I would like to merge two datasets. Both datasets have the following columns: Year Gvkey Cusip CompanyName. These variables all have some missing values.
However, to maximize the number of observations after merging two datasets, I would like to use the following methods:
1. merge using year gvkey first, but if missing gvkey, then merge using year cusip.
2. merge using year gvkey first, but if missing gvkey, then merge using year company name (fuzzy match).
Could anyone share some thoughts on how to achieve these? Thanks!!
Related Posts with Dataset merging to maximize number of observations
What is wrong my code as posted here?There is two region dummy variables mi (mi takes the value 1 if Michigan and 0 otherwise) and 32 mon…
RE -xtgee- Equivalence with GLM?Hi! I was looking around for a panel version of GLM to run a log-gamma model on. From what I read i…
generating parents education variableHi everybody, I have a dataset that includes the variables: "family id", "highest diploma", "gender"…
Arch, Garch Tarch model and post estimation in PANEL DATAGood morning. I'm am running a model with Arch, T arch and Garch effects over stock returns and with…
Interaction effect with continuous criterion variable, dummy predictor and dummy moderator as linear plot?Hello everyone, I have a question regarding the adequacy of using a linear visualisation plot for a…
Subscribe to:
Post Comments (Atom)
0 Response to Dataset merging to maximize number of observations
Post a Comment