I have the following 3 dimensions’ multiple variable region-level panel data. This dataset consists of 267 variable series covering 548 regions and 19 year: variable ID, denoted as sid = (var1, var2, var3, …, var267), the region id, denoted as bps_id =(1, 2, 3, …, 548), and year, denoted as year =(2000, 2001, 2002, …, 2018).
The data structure is described as.
sid bps_id yr2000 yr2001 … yr2018 var1 1 XXX XXX … XXX var1 2 XXX XXX … XXX var1 3 XXX XXX … XXX ⸽ ⸽ ⸽ ⸽ ⸽ ⸽ var1 548 XXX XXX … XXX var2 1 XXX XXX … XXX var2 2 XXX XXX … XXX var2 3 XXX XXX … XXX ⸽ ⸽ ⸽ ⸽ ⸽ ⸽ |
bps_id year var1 var2 … var267 1 2000 XXX XXX … XXX 2 2000 XXX XXX … XXX ⸽ ⸽ ⸽ ⸽ ⸽ 548 2000 XXX XXX … XXX 1 2001 XXX XXX … XXX 2 2001 XXX XXX … XXX ⸽ ⸽ ⸽ ⸽ ⸽ 548 2001 XXX XXX … XXX |
reshape long yr var, i(sid bps_id) j(year)
However, the results is not what I expected as.
Data wide -> long
Number of obs. 146316 -> 2.8e+06
Number of variables 43 -> 27
j variable (19 values) -> year
xij variables:
yr2000 yr2001 ... yr2018 -> yr
var2000 var2001 ... var2018 -> var
I use the I use STATA version 16.
Would you help me with your expertise?
0 Response to Reshaping the multiple variable region-level panel data from the wide to the long-shape format
Post a Comment