I have a large dataset with rows that include a unique identifier 'id' and a unique time variable called 'timeto'. There are variable numbers of duplicate id's but all in the dataset are duplicated at one point or another. I would like to create a flag indicating when two rows with the same id have 'timeto' values that are <= 2. If there are more than one, I'd like to count up. Examples below:
starting data:
id timeto
1 3
2 7
2 8
2 0
2 14
2 15
3 4
3 6
ending data:
id timeto flag
1 3 0
2 7 1
2 8 1
2 0 0
2 14 2
2 15 2
3 4 1
3 6 1
Eventually, for those 'pairs' of rows with the same ID and timeto within 2, I'd be planning to pivot wide so there is a single row for each ID. Any help would be appreciated!
Related Posts with Identifying and flagging matches
Using stset for recurring event analysisHi all, Apologies if this is very simple. I'm trying to run a recurring-event survival analysis us…
Hausman TestHello, This is my first post so forgive me if I explain anything wrong. I am currently running a pan…
Recentered Influence Functions (RIF) in Stata: RIF-Regression and RIF-DecompositionDear all, Thanks to Prof Baum, a new update to the "oaxaca_rif" command is now available. This new u…
Combining CMP and Finite Mixture Models (FMM)Dear all, I estimate a recursive system of equations using cmp command and like to adress unbserved…
egen newvar = group(oldvar), labelHi. I want to assign values to string data. Therefore I tried to convert the string var into a nume…
Subscribe to:
Post Comments (Atom)
0 Response to Identifying and flagging matches
Post a Comment