I'm new to Stata and I would like to have some of your advice!
I have gathered hourly data of pollution emissions from 7 different monitoring stations in the city of Paris, France. I have 24 observations per day for each polluant, from each monitoring station, one per hour of the day, from 01/10/2009 to 01/10/2019. Time is my independent variable. My data set is like shown below:
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input double DateTime int(no2_pa13 pm25_aut) 1.5471648e+12 81 105 1.5471684e+12 86 99 1.547172e+12 87 91 1.5471756e+12 89 86 1.5471792e+12 87 86 1.5471828e+12 83 87 1.5471864e+12 87 89 1.54719e+12 94 95 1.5471936e+12 102 98 1.5471972e+12 114 97 1.5472008e+12 127 102 1.5472044e+12 145 114 1.547208e+12 166 126 1.5472116e+12 161 136 1.5472152e+12 167 141 1.5472188e+12 147 137 1.5472224e+12 119 131 1.547226e+12 99 129 1.5472296e+12 116 145 1.5472332e+12 128 162 1.5472368e+12 124 159 1.5472404e+12 126 143 1.547244e+12 133 143 1.5472476e+12 152 161 1.5472512e+12 131 169 1.5472548e+12 114 153 1.5472584e+12 110 140 1.547262e+12 105 138 1.5472656e+12 93 146 1.5472692e+12 88 150 1.5472728e+12 83 149 1.5472764e+12 75 137 1.54728e+12 67 127 1.5472836e+12 65 126 1.5472872e+12 78 128 1.5472908e+12 97 134 1.5472944e+12 104 129 1.547298e+12 99 127 1.5473016e+12 97 126 1.5473052e+12 104 123 1.5473088e+12 110 120 1.5473124e+12 116 113 1.547316e+12 116 112 1.5473196e+12 115 110 1.5473232e+12 117 97 1.5473268e+12 115 89 1.5473304e+12 104 78 1.547334e+12 93 79 1.5473376e+12 83 92 1.5473412e+12 75 91 1.5473448e+12 73 88 1.5473484e+12 69 73 1.547352e+12 68 67 1.5473556e+12 75 66 1.5473592e+12 83 67 1.5473628e+12 82 61 1.5473664e+12 94 59 1.54737e+12 . 58 1.5473736e+12 . 59 1.5473772e+12 72 57 1.5473808e+12 66 49 1.5473844e+12 65 45 1.547388e+12 73 43 1.5473916e+12 81 39 1.5473952e+12 76 36 1.5473988e+12 78 34 1.5474024e+12 77 31 1.547406e+12 77 28 1.5474096e+12 72 27 1.5474132e+12 56 24 1.5474168e+12 39 22 1.5474204e+12 32 19 1.547424e+12 32 16 1.5474276e+12 35 12 1.5474312e+12 40 13 1.5474348e+12 40 15 1.5474384e+12 45 17 1.547442e+12 69 17 1.5474456e+12 83 25 1.5474492e+12 95 28 1.5474528e+12 81 26 1.5474564e+12 77 26 1.54746e+12 67 29 1.5474636e+12 65 31 1.5474672e+12 63 36 1.5474708e+12 67 38 1.5474744e+12 71 40 1.547478e+12 75 42 1.5474816e+12 81 33 1.5474852e+12 83 30 1.5474888e+12 77 32 1.5474924e+12 58 31 1.547496e+12 56 34 1.5474996e+12 53 36 1.5475032e+12 49 36 1.5475068e+12 54 34 1.5475104e+12 40 28 1.547514e+12 23 22 1.5475176e+12 27 19 1.5475212e+12 30 16 end format %tcMonth_dd,_CCYY_HH:MM:SS DateTime
For instance, no2 is the name of the pollutant, pa13 is the monitoring station's name.
From that, I would like to create two new variables. I would like to have average daily pollution levels, for each polluant, across all 7 stations, between 01/10/2009 and 01/10/2019. I would also like to have pollution levels across the hours of the day, for each pollutant, across all 7 stations. How can I create these variables from my data set?
Thanks for your help!
Guillaume
0 Response to Creating daily average and hourly levels from hourly data
Post a Comment