Hello,
I wanted to see the distribution of firms in the various industry areas (based on the first 2 digits of SIC codes). Here is what I've written and the error obtained:
// 1. SIC (2 digits) classification:
// we create a string variable (sic2) with the first 2 digits of sic in order to indentify the industry
gen sic2=substr(sic,1,2)
// we encode the variable so that we obtain a numeric variable instead of a string one
encode sic2, generate (SIC2)
// next we generate variable SIC_group to group the various sic into the appropriate industry
recode SIC2 (01/09=0) (10/14=1) (15/17=2) (20/39=3) (40/49=4) (50/51=5) (52/59=6) (60/67=7) (70/89=8) (91/97=9) (else=.), generate(SIC_group)
Unfortunately, when I check the variable SIC_groups, I obtain that firms with SIC 7363, so with a SIC2 of 73, are placed in group 6, which is wrong. The error occurs also in other groups and sic.
Also, do you know which command can I use later to see the number of GVKEY per industry group?
Has someone any idea what to do?
Thank you in advance
Related Posts with Distribution of firms by industry (based on SIC codes)
Using indented foreach and forvaluesDear all, I am trying for the first time to use foreach and forvalues, but with little success. The …
Comparing rates in stataHello, I am new to STATA and am using Version 15. Due to the confidential nature of my data I canno…
calculating Growth rates with a gap of 3 years in a panel dataDear statlist, I am working on a panel data set and need to calculate growth rates of each panel ID…
Out-of-sample forecastingHello, I'm trying to forecast an arima(1,0,1) model with 4 variables out of sample. The dataset run…
Graphical analysis of ratesHello, I am new to STATA and am using Version 15. Due to the confidential nature of my data I canno…
Subscribe to:
Post Comments (Atom)
0 Response to Distribution of firms by industry (based on SIC codes)
Post a Comment