So, I have a very large yearly database on students of different educational levels (3 Million observations per month).
What I need to do is to append all months together in order to form a balanced panel. In Month 1 (t=1), I have a database that looks like:
Id | Year | Educational Level |
100 | 1 | 1 |
101 | 1 | 1 |
102 | 1 | 1 |
103 | 1 | 2 |
104 | 1 | 2 |
100 | 2 | 2 |
101 | 2 | 2 |
102 | 2 | 2 |
103 | 2 | 3 |
So what I want to do is to use an append command that only appends the subset of id's that had an educational level equal to 1 in the baseline years. So In the example above, I want to append people with Id 100 through 102, but not 103 through 104.
This is because the database is very large, so I need to do the append with only the essential observations, instead of merging everything and then dropping the observations I don't need.
Any help with this?
Regards,
Alejandro
0 Response to How to append a sub-set of observations.
Post a Comment