Dear Stata community,
I am new to the forum and to Stata, too. I am trying to calculate the HHI (Herfindahl-Hirschman Index) in my panel data set in STATA 16.1. To give you an idea how my data structure with the relevant variables looks like, please see the following example values and variables:
The variables for the firms is gvkey, sale is the net sales of the firm, sic is the 4 digit SIC code and fyear is the fiscal year.
[CODE]
* Example generated by -dataex-. To install: ssc install dataex
clear
input str6 gvkey double fyear str16 sic double sale
"001001" 1984 "5812" 32.007
"001001" 1985 "5812" 53.798
"001003" 1983 "5712" 13.793
"001003" 1984 "5712" 13.829
"001003" 1986 "5712" 36.308
"001003" 1987 "5712" 37.356
"001003" 1988 "5712" 32.808
"001004" 1980 "5080" 132.482
"001004" 1981 "5080" 175.924
"001004" 1982 "5080" 155.006
"001004" 1983 "5080" 177.762
"001004" 1984 "5080" 218.946
"001004" 1985 "5080" 248.012
"001004" 1986 "5080" 298.192
"001004" 1987 "5080" 347.64
"001004" 1988 "5080" 406.36
"001004" 1989 "5080" 444.875
"001004" 1990 "5080" 466.542
"001004" 1991 "5080" 422.657
"001004" 1992 "5080" 382.78
"001004" 1993 "5080" 407.754
"001005" 1980 "3724" 23.382
"001005" 1981 "3724" 35.921
"001007" 1980 "3652" 9.262
"001007" 1981 "3652" 7.261
"001007" 1982 "3652" 4.993
"001007" 1983 "3652" 3.839
"001008" 1985 "3577" .705
"001009" 1982 "3460" 36.01
"001009" 1983 "3460" 18.753
"001009" 1984 "3460" 21.019
"001009" 1985 "3460" 20.507
"001009" 1986 "3460" 19.266
"001009" 1987 "3460" 19.55
"001009" 1988 "3460" 28.419
I started with the following code:
ssc install hhi
hhi sale, by (sic fyear)
sort sic fyear
However, I got the error „Negative values in varlist“. Therefore I changed the code to:
ssc install hhi
replace sale = . if sale < 0
drop if sale == .
hhi sale, by (sic fyear)
sort sic fyear
The code runs and I get the following result:
[CODE]
* Example generated by -dataex-. To install: ssc install dataex
clear
input str6 gvkey double fyear str16 sic double sale float hhi_sale
"008596" 1980 "0100" 405.884 .440715
"009391" 1980 "0100" 5.073 .440715
"010390" 1980 "0100" 25.022 .440715
"010884" 1980 "0100" 3762.579 .440715
"002099" 1980 "0100" 87.565 .440715
"001266" 1980 "0100" 12.517 .440715
"010971" 1980 "0100" 165.766 .440715
"002812" 1980 "0100" 1733.501 .440715
"010802" 1980 "0100" 79.928 .440715
"010884" 1981 "0100" 4058.385 .6711329
"010971" 1981 "0100" 247.284 .6711329
"009391" 1981 "0100" 5.478 .6711329
"010802" 1981 "0100" 76.542 .6711329
"001266" 1981 "0100" 12.346 .6711329
"002099" 1981 "0100" 100.759 .6711329
"010390" 1981 "0100" 20.977 .6711329
"008596" 1981 "0100" 477.995 .6711329
"001266" 1982 "0100" 9.955 .4613737
"002099" 1982 "0100" 110.442 .4613737
"010802" 1982 "0100" 78.755 .4613737
"002812" 1982 "0100" 1823.232 .4613737
"010390" 1982 "0100" 20.854 .4613737
"008596" 1982 "0100" 557.398 .4613737
"009391" 1982 "0100" 5.596 .4613737
"009062" 1982 "0100" 4.527 .4613737
"010971" 1982 "0100" 222.384 .4613737
"010390" 1983 "0100" 23.553 .4112865
"010884" 1983 "0100" 3360.441 .4112865
"006275" 1983 "0100" .424 .4112865
"008596" 1983 "0100" 505.434 .4112865
"001266" 1983 "0100" 8.877 .4112865
"009062" 1983 "0100" 4.496 .4112865
"002812" 1983 "0100" 1551.725 .4112865
"010971" 1983 "0100" 274.672 .4112865
"009391" 1983 "0100" 2.908 .4112865
"002099" 1983 "0100" 111.037 .4112865
"003702" 1984 "0100" 1.731 .40012285
"010390" 1984 "0100" 22.14 .40012285
"010971" 1984 "0100" 258.357 .40012285
"002812" 1984 "0100" 1520.088 .40012285
I am really unsure if my code is correct and if I really get the result I am looking for. Could someone of you have a look and check if the procedure is right?
Why do I write the code hhi sale, by (sic fyear) and not only hhi sale, by (sic)? Meaning why do I include the fyear in my HHI? And what does my result including the fyear is telling me now?
I would really appreciate your help as I am quite lost.
Thank you already in advance.
Shakira
Related Posts with Problem with calculating the HHI (Herfindahl-Hirschman Index)
Interaction effect outliers?I am very new to STATA and doing research. Something I do not quite understand. I am looking into a…
Update to -survsim- on SSC: Simulating time-to-event data from custom distributions, competing risk models and general multi-state modelsThanks to Kit Baum, the -survsim- package has been updated on SSC. This is a complete re-write, with…
Margins after MixedHey all, I'm running a 2-level model using the - mixed - command. Can I use the regular - margins -…
Obtaining pooled OLS estimatesI'm trying to compare Oaxaca-Blinder decomposition results to OLS estimates regarding the male wage …
clustering robust standard errors in one wave datasetHello, I have a one wave dataset of 158 firms, is it useful to cluster the standard errors? If so, …
Subscribe to:
Post Comments (Atom)
0 Response to Problem with calculating the HHI (Herfindahl-Hirschman Index)
Post a Comment