Dear Stata community,
I am new to the forum and to Stata, too. I am trying to calculate the HHI (Herfindahl-Hirschman Index) in my panel data set in STATA 16.1. To give you an idea how my data structure with the relevant variables looks like, please see the following example values and variables:
The variables for the firms is gvkey, sale is the net sales of the firm, sic is the 4 digit SIC code and fyear is the fiscal year.
[CODE]
* Example generated by -dataex-. To install: ssc install dataex
clear
input str6 gvkey double fyear str16 sic double sale
"001001" 1984 "5812" 32.007
"001001" 1985 "5812" 53.798
"001003" 1983 "5712" 13.793
"001003" 1984 "5712" 13.829
"001003" 1986 "5712" 36.308
"001003" 1987 "5712" 37.356
"001003" 1988 "5712" 32.808
"001004" 1980 "5080" 132.482
"001004" 1981 "5080" 175.924
"001004" 1982 "5080" 155.006
"001004" 1983 "5080" 177.762
"001004" 1984 "5080" 218.946
"001004" 1985 "5080" 248.012
"001004" 1986 "5080" 298.192
"001004" 1987 "5080" 347.64
"001004" 1988 "5080" 406.36
"001004" 1989 "5080" 444.875
"001004" 1990 "5080" 466.542
"001004" 1991 "5080" 422.657
"001004" 1992 "5080" 382.78
"001004" 1993 "5080" 407.754
"001005" 1980 "3724" 23.382
"001005" 1981 "3724" 35.921
"001007" 1980 "3652" 9.262
"001007" 1981 "3652" 7.261
"001007" 1982 "3652" 4.993
"001007" 1983 "3652" 3.839
"001008" 1985 "3577" .705
"001009" 1982 "3460" 36.01
"001009" 1983 "3460" 18.753
"001009" 1984 "3460" 21.019
"001009" 1985 "3460" 20.507
"001009" 1986 "3460" 19.266
"001009" 1987 "3460" 19.55
"001009" 1988 "3460" 28.419
I started with the following code:
ssc install hhi
hhi sale, by (sic fyear)
sort sic fyear
However, I got the error „Negative values in varlist“. Therefore I changed the code to:
ssc install hhi
replace sale = . if sale < 0
drop if sale == .
hhi sale, by (sic fyear)
sort sic fyear
The code runs and I get the following result:
[CODE]
* Example generated by -dataex-. To install: ssc install dataex
clear
input str6 gvkey double fyear str16 sic double sale float hhi_sale
"008596" 1980 "0100" 405.884 .440715
"009391" 1980 "0100" 5.073 .440715
"010390" 1980 "0100" 25.022 .440715
"010884" 1980 "0100" 3762.579 .440715
"002099" 1980 "0100" 87.565 .440715
"001266" 1980 "0100" 12.517 .440715
"010971" 1980 "0100" 165.766 .440715
"002812" 1980 "0100" 1733.501 .440715
"010802" 1980 "0100" 79.928 .440715
"010884" 1981 "0100" 4058.385 .6711329
"010971" 1981 "0100" 247.284 .6711329
"009391" 1981 "0100" 5.478 .6711329
"010802" 1981 "0100" 76.542 .6711329
"001266" 1981 "0100" 12.346 .6711329
"002099" 1981 "0100" 100.759 .6711329
"010390" 1981 "0100" 20.977 .6711329
"008596" 1981 "0100" 477.995 .6711329
"001266" 1982 "0100" 9.955 .4613737
"002099" 1982 "0100" 110.442 .4613737
"010802" 1982 "0100" 78.755 .4613737
"002812" 1982 "0100" 1823.232 .4613737
"010390" 1982 "0100" 20.854 .4613737
"008596" 1982 "0100" 557.398 .4613737
"009391" 1982 "0100" 5.596 .4613737
"009062" 1982 "0100" 4.527 .4613737
"010971" 1982 "0100" 222.384 .4613737
"010390" 1983 "0100" 23.553 .4112865
"010884" 1983 "0100" 3360.441 .4112865
"006275" 1983 "0100" .424 .4112865
"008596" 1983 "0100" 505.434 .4112865
"001266" 1983 "0100" 8.877 .4112865
"009062" 1983 "0100" 4.496 .4112865
"002812" 1983 "0100" 1551.725 .4112865
"010971" 1983 "0100" 274.672 .4112865
"009391" 1983 "0100" 2.908 .4112865
"002099" 1983 "0100" 111.037 .4112865
"003702" 1984 "0100" 1.731 .40012285
"010390" 1984 "0100" 22.14 .40012285
"010971" 1984 "0100" 258.357 .40012285
"002812" 1984 "0100" 1520.088 .40012285
I am really unsure if my code is correct and if I really get the result I am looking for. Could someone of you have a look and check if the procedure is right?
Why do I write the code hhi sale, by (sic fyear) and not only hhi sale, by (sic)? Meaning why do I include the fyear in my HHI? And what does my result including the fyear is telling me now?
I would really appreciate your help as I am quite lost.
Thank you already in advance.
Shakira
Related Posts with Problem with calculating the HHI (Herfindahl-Hirschman Index)
define column width in xl() "open" depending on values (reals & strings)?Hi! How may I define the column width for each column depending on the maximum length of the charac…
Moderation effectHi Everyone, I am running a panel model using population averaged model. I have different hypothese…
Choice models -cmxtmixlogit- command for including random parameter for one of the alternatives (the base/status quo alternative)I am having trouble with the new stata 16 cmxtmixlogit command. I am trying to estimate a choice mo…
Estimation using only one out of many categorical variables without dropping the restDear members, My dataset contains 4 countries and I'd like to run a regression using only one of th…
Margin effects with multinomial logistic regressionHi dear Statalisters, I am running a mlogit model with interaction effects between categorical vari…
Subscribe to:
Post Comments (Atom)
0 Response to Problem with calculating the HHI (Herfindahl-Hirschman Index)
Post a Comment