Hello,
So let me first describe what I am working with and the steps I took.
One dataset has for every company every year stated like, 1992, 1993, 1994, 1995.. and so on. with financial information about companies.
the next dataset is about CEO financials, and there it's 1992, 1992, 1992, 1993, 1993, 1993, 1993.. and so on. Because some companies have more or less CEO's per year.
I have used 1:m merge, and it looks like its merged. I then looked at the CEO GENDER, which is main part of my study, and deleted the rows which did not match.
I checked whether it removed rows of information, but it kept the information of Dataset 1 and moved/multiplied the information from the 'company financials years' to CEO financials. Like this: [merged]
Year....CEO FIN....Company FIN
X1......X1-Male............X1
X1......X1-Male............X1
X1......X1-Female............X1
Now it's of course counting all the years multiple times and therefore the sample size looks deceitful, it's giving something like 293,123 as sample size only because the years are counted more than once.
Is there a way to shorten the rows when using it in calculations? And secondly, I thought to only keep 1 row per year of the information when there is a Female CEO that year (if more, I summarize) I'll use only X1-Female, and otherwise I'll use Male (X1-Male). So that per company year there will be one row. But am kinda clueless how to perform such task.
Does anyone know how to do this, or even if this is possible. Thanks.
Related Posts with dataset is showing
Need help with codingHi I have an employee dataset of the following form: Employee_Code Office_Code Year Duration Pres…
Moderation with TestparmHello, I am running a moderation analysis looking to see if Race (4 categories) moderates the relat…
Autocorrelation heteroskadasicityHi guys, I have a panel data set with 2Ts 500N and 2Ts 30N I use OLS and Fixed effects estimations I…
Error Setting/Reshaping a Multiple Imputation DatasetHi everyone, I have several (potentially stupid) questions about multiple imputation. I've tried rea…
How to Test The Parallel Trends Assumption Of Difference in Differences EstimationHello All, I apologize for this question, as I know that it has been asked before but I am unable t…
Subscribe to:
Post Comments (Atom)
0 Response to dataset is showing
Post a Comment