I am using Stata 16 and would need help in the extraction of data. I intend to use data frames to do so but I'm not very sure how to do so. Below is a subset of my dataset:
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input float id str3 location str7 time long quarters float unemp_data byte pt_output float spell 1 "AUT" "1969-Q1" 1 2.28056 . 0 2 "AUT" "1969-Q2" 2 1.799027 . 0 3 "AUT" "1969-Q3" 3 1.735777 . 0 4 "AUT" "1969-Q4" 4 1.718443 . 0 5 "AUT" "1970-Q1" 5 1.538099 . 0 6 "AUT" "1970-Q2" 6 1.328224 . 0 7 "AUT" "1970-Q3" 7 1.198998 . 0 8 "AUT" "1970-Q4" 8 1.225661 . 0 9 "AUT" "1971-Q1" 9 1.173203 . 0 10 "AUT" "1971-Q2" 10 1.33824 . 0 11 "AUT" "1971-Q3" 11 1.042691 . 0 12 "AUT" "1971-Q4" 12 1.277456 . 0 13 "AUT" "1972-Q1" 13 1.15306 . 0 14 "AUT" "1972-Q2" 14 1.064005 . 0 15 "AUT" "1972-Q3" 15 1.357699 . 0 16 "AUT" "1972-Q4" 16 1.039917 . 0 17 "AUT" "1973-Q1" 17 1.035841 . 0 18 "AUT" "1973-Q2" 18 1.177305 . 0 19 "AUT" "1973-Q3" 19 1.108755 . 0 20 "AUT" "1973-Q4" 20 .9943526 . 0 21 "AUT" "1974-Q1" 21 1.26808 . 0 22 "AUT" "1974-Q2" 22 1.299121 . 0 23 "AUT" "1974-Q3" 23 1.308654 1 1 24 "AUT" "1974-Q4" 24 1.478947 . 1 25 "AUT" "1975-Q1" 25 1.452747 . 1 26 "AUT" "1975-Q2" 26 1.88046 2 1 27 "AUT" "1975-Q3" 27 1.734818 . 1 28 "AUT" "1975-Q4" 28 1.982104 . 1 29 "AUT" "1976-Q1" 29 1.854092 . 1 30 "AUT" "1976-Q2" 30 1.830328 . 1 31 "AUT" "1976-Q3" 31 1.804084 . 1 32 "AUT" "1976-Q4" 32 1.657884 . 1 33 "AUT" "1977-Q1" 33 1.556965 . 1 34 "AUT" "1977-Q2" 34 1.492262 . 1 35 "AUT" "1977-Q3" 35 1.743809 . 1 36 "AUT" "1977-Q4" 36 1.708798 . 1 37 "AUT" "1978-Q1" 37 1.78792 . 1 38 "AUT" "1978-Q2" 38 2.194976 . 1 39 "AUT" "1978-Q3" 39 2.050228 . 1 40 "AUT" "1978-Q4" 40 2.223532 . 1 41 "AUT" "1979-Q1" 41 2.196454 . 1 42 "AUT" "1979-Q2" 42 2.063658 . 1 43 "AUT" "1979-Q3" 43 2.00938 . 1 44 "AUT" "1979-Q4" 44 2.050331 . 1 45 "AUT" "1980-Q1" 45 1.729307 1 2 46 "AUT" "1980-Q2" 46 1.763711 . 2 47 "AUT" "1980-Q3" 47 2.052905 . 2 48 "AUT" "1980-Q4" 48 1.885693 . 2 49 "AUT" "1981-Q1" 49 2.130085 2 2 50 "AUT" "1981-Q2" 50 2.577704 . 2 51 "AUT" "1981-Q3" 51 2.53258 . 2 52 "AUT" "1981-Q4" 52 2.889663 . 2 53 "AUT" "1982-Q1" 53 2.902107 . 2 54 "AUT" "1982-Q2" 54 3.268415 1 3 55 "AUT" "1982-Q3" 55 3.767183 . 3 56 "AUT" "1982-Q4" 56 4.004116 2 3 57 "AUT" "1983-Q1" 57 4.186985 . 3 58 "AUT" "1983-Q2" 58 4.326129 . 3 59 "AUT" "1983-Q3" 59 4.280576 . 3 60 "AUT" "1983-Q4" 60 3.7363 1 4 61 "AUT" "1984-Q1" 61 3.937536 . 4 62 "AUT" "1984-Q2" 62 3.801948 2 4 63 "AUT" "1984-Q3" 63 3.670202 . 4 64 "AUT" "1984-Q4" 64 3.740511 . 4 65 "AUT" "1985-Q1" 65 3.584991 . 4 66 "AUT" "1985-Q2" 66 3.616743 . 4 67 "AUT" "1985-Q3" 67 3.645316 . 4 68 "AUT" "1985-Q4" 68 3.545668 . 4 69 "AUT" "1986-Q1" 69 3.312451 . 4 70 "AUT" "1986-Q2" 70 2.947661 . 4 71 "AUT" "1986-Q3" 71 2.836876 . 4 72 "AUT" "1986-Q4" 72 3.318925 . 4 73 "AUT" "1987-Q1" 73 3.932383 . 4 74 "AUT" "1987-Q2" 74 3.714838 . 4 75 "AUT" "1987-Q3" 75 3.986203 . 4 76 "AUT" "1987-Q4" 76 3.516049 . 4 77 "AUT" "1988-Q1" 77 3.595898 . 4 78 "AUT" "1988-Q2" 78 3.492004 . 4 79 "AUT" "1988-Q3" 79 3.606824 . 4 80 "AUT" "1988-Q4" 80 3.512885 . 4 81 "AUT" "1989-Q1" 81 3.068615 . 4 82 "AUT" "1989-Q2" 82 3.235731 . 4 83 "AUT" "1989-Q3" 83 2.959549 . 4 84 "AUT" "1989-Q4" 84 3.307182 . 4 85 "AUT" "1990-Q1" 85 3.153923 . 4 86 "AUT" "1990-Q2" 86 3.269969 . 4 87 "AUT" "1990-Q3" 87 3.412282 . 4 88 "AUT" "1990-Q4" 88 3.239165 . 4 89 "AUT" "1991-Q1" 89 3.383817 . 4 90 "AUT" "1991-Q2" 90 3.44205 . 4 91 "AUT" "1991-Q3" 91 3.58801 . 4 92 "AUT" "1991-Q4" 92 3.613474 . 4 93 "AUT" "1992-Q1" 93 3.392481 . 4 94 "AUT" "1992-Q2" 94 3.799317 . 4 95 "AUT" "1992-Q3" 95 3.598001 1 5 96 "AUT" "1992-Q4" 96 3.767504 . 5 97 "AUT" "1993-Q1" 97 3.833333 2 5 98 "AUT" "1993-Q2" 98 3.966667 . 5 99 "AUT" "1993-Q3" 99 4.033333 . 5 100 "AUT" "1993-Q4" 100 4 . 5 end
The variable "pt_output" indicates a recession date; it is equal to one if there is a recession, equals to 2 if the recession ended. I intend to create multiple columns that will extract data from the column "unemp_data". In each column, the data will include the data from the recession date and 20 rows before and after the recession date. For example, the first recession starts at 1974-Q3, I would need to extract data from row 23 of the column "unemp_data" and data from row 3 to 22 as well as data from row 24 to 34 in column "unemp_data".
I am not sure if the use of data frames can do such a task. Any help in this area would be appreciated. Thanks!
0 Response to Help needed for using data frames to extract data
Post a Comment