hi Stata Gurus,
I am trying to check the outliners in my data. For example, the total asset (ta) of european banks in bank-quarter level
. list ta in 10/20
+----------+
| ta |
|----------|
10. | . |
11. | . |
12. | . |
13. | . |
14. | 1.27e+09 |
|----------|
15. | 1.30e+09 |
16. | 1.30e+09 |
17. | 1.31e+09 |
18. | 1.36e+09 |
19. | 1.39e+09 |
|----------|
20. | 1.41e+09 |
+----------+
. sum ta
Variable | Obs Mean Std. dev. Min Max
-------------+---------------------------------------------------------
ta | 2,748 1.62e+08 3.91e+08 152426 2.67e+09
. codebook ta
-------------------------------------------------------------------------------------------------------
ta (unlabeled)
-------------------------------------------------------------------------------------------------------
Type: Numeric (float)
Range: [152426.03,2.673e+09] Units: .01
Unique values: 2,744 Missing .: 6,300/9,048
Mean: 1.6e+08
Std. dev.: 3.9e+08
Percentiles: 10% 25% 50% 75% 90%
1.7e+06 5.4e+06 2.0e+07 7.6e+07 4.5e+08
I tried to detect the outliners by graphing boxplot and all of the observations which are NOT missing values are graphed as Outliners.
. graph box ta - seems to assign numeric value to missing observations and include them when graphing
https://ift.tt/32kNO72
I tried to tell Stata to graph only non-missing value but nothing really works
. graph box ta if ta >= 152426 *Graph only the observation with value starting from the min(ta)
. graph box ta if ! missing (ta)
All suggestion are appreciated,
Best
0 Response to Why does Stata graph box not ignore missing values _
Post a Comment