Hello all,
I am a new STATA user and have some questions about winsorizing.
For example, I want to winsorize variable a with 20 observations at 5% and 95% percentile: -40 -5 10 13 15 19 26 28 41 58 78 85 86 89 89 91 92 101 101 1053 (-40 and 1053 are outliers given 5% and 95% percentile)
Code winsor a, gen(a_w) p(0.05) gives me: -5 -5 10 13 15 19 26 28 41 58 78 85 86 89 89 91 92 101 101 101
and code winsor2 a, suffix(_w2) cuts(5 95) gives me: -22.5 -5 10 13 15 19 26 28 41 58 78 85 86 89 89 91 92 101 101 577
Base on my understanding, both codes should perform the same task. So why the results are different? Which one is correct?
Another more general question, if one wants to winsorize a string of data such as 1 2 3 4 ...98 99 100 at 1% and 99% percentile, what is the correct result? Should it be 2 2 3 4....98 99 99?
Han
Related Posts with winsor and winsor2, different results
Two stage Program: First stage is OLS and second is mlogit, error in calculating bootstrap standard errorsStata Code: program my2sls regress son_yrs_schooling ib(first).age_cohort father_yrs_schooling ib(l…
mixed Command; Using Random Intercept as a Control variablHi. I am using mixed modeling for my multilevel data across countries, industries, and firms. I need…
Unsignificant resultsMy name is Muhammad Rashid student of MS BA I have panel data for 10 years of 1600 companies when I …
xtdpdgmm command - @SebastianKripfganzDear Dr. Kripfganz, Following your suggestions in the previous posts I decided to use xtdpdgmm comm…
Balanced panel?Hello everyone, I have a very simple question that I can't seem to work out an answer to. That is w…
Subscribe to:
Post Comments (Atom)
0 Response to winsor and winsor2, different results
Post a Comment