Hello dear Statalist,
I am quite new with Stata and I could really need help for my thesis.
I have variables called "cusip" (firm code), "mgrno" (numerical code for investor (unique)), "rdate" (date), "typecode" (classification coding from 1-5 based on investor type), "shares" (number of shares held by each investor in a firm), and shrout2 (total shares for each firm in 1000).
However, some investors report their holdings multiple times a year (as can be seen from below), so if I would try to sum all the shares based on typecode for each firm, many would appear multiple times since they are reported quarterly. Not all however, so how can I do this? If I would like to get the total ownership for each type of investor for every firm for the latest date the ownership has been reported for example.
In the end, I would like to have a database of the ownership % (shares held by each type in each firm/total shares for firm) for each of the 5 investor types for each firm for that year.
Could someone help me?
When I tried dropping rdate duplicates based on mgrno and cusip, I lost necessary observations.
Array
Related Posts with Stata data manipulation filtering, summing and removing duplicates from quarterly observation data
Cannot install estout packageHello everyone, I have been experiencing some issues while trying to install the estout package on m…
encode with strange number?Dear All, I have this data set in Stata format (dataex may not be appropriate for my purpose), encod…
Creating a new variable (that changes value in every 3 days....)I am working with survey dataset. It has 30 people (variable name = dlp, numeric) who start work in …
Dealing with Highly Collinear Independent VariablesDear Stata Members I have a panel data where my independent variables are highly COLLINEAR(Index1 t…
Trim and fill using metatrimI looked through other forum posts (post1, post2, post3, post4) and this question has been asked man…
Subscribe to:
Post Comments (Atom)
0 Response to Stata data manipulation filtering, summing and removing duplicates from quarterly observation data
Post a Comment