Hello all,
I am a new STATA user and have some questions about winsorizing.
For example, I want to winsorize variable a with 20 observations at 5% and 95% percentile: -40 -5 10 13 15 19 26 28 41 58 78 85 86 89 89 91 92 101 101 1053 (-40 and 1053 are outliers given 5% and 95% percentile)
Code winsor a, gen(a_w) p(0.05) gives me: -5 -5 10 13 15 19 26 28 41 58 78 85 86 89 89 91 92 101 101 101
and code winsor2 a, suffix(_w2) cuts(5 95) gives me: -22.5 -5 10 13 15 19 26 28 41 58 78 85 86 89 89 91 92 101 101 577
Base on my understanding, both codes should perform the same task. So why the results are different? Which one is correct?
Another more general question, if one wants to winsorize a string of data such as 1 2 3 4 ...98 99 100 at 1% and 99% percentile, what is the correct result? Should it be 2 2 3 4....98 99 99?
Han
Related Posts with winsor and winsor2, different results
How can I copy values for a variable to other observations within the same variable?Hello, I have a similar question as in https://www.statalist.org/forums/for...same-household which …
Fixed effects dummies in OLS regression vs FE estimator (xtreg, fe)This is a very basic question but after doing some research I could not find the answer to this. I k…
Fixed-effect panel threshold modelDear all im working on my PhD thesis and i'm using the fixed-effect threshold model i would ask yo…
Error last estimates not found.Hello I am trying to run a regression among 220 firms from 1998-2018 to predict a yearly beta value.…
Dependent variable lagsHello How to create a dependent variable lags in the data panel (in stata)? …
Subscribe to:
Post Comments (Atom)
0 Response to winsor and winsor2, different results
Post a Comment