I have a large dataset with rows that include a unique identifier 'id' and a unique time variable called 'timeto'. There are variable numbers of duplicate id's but all in the dataset are duplicated at one point or another. I would like to create a flag indicating when two rows with the same id have 'timeto' values that are <= 2. If there are more than one, I'd like to count up. Examples below:
starting data:
id timeto
1 3
2 7
2 8
2 0
2 14
2 15
3 4
3 6
ending data:
id timeto flag
1 3 0
2 7 1
2 8 1
2 0 0
2 14 2
2 15 2
3 4 1
3 6 1
Eventually, for those 'pairs' of rows with the same ID and timeto within 2, I'd be planning to pivot wide so there is a single row for each ID. Any help would be appreciated!
Related Posts with Identifying and flagging matches
mergehi. i am very confusing in paper code , there are adj_price.dta that data have few data, therefore w…
Adding duplicates togetherI have two data sets. data set a with group level data for variables x, y, and group_id. data set …
Regressor not found in a loop with existing variable.Hello, I am trying to run this loop to generate F* variables. When I run the loop it stops because i…
Randomly generating new list of room mates and room numbers from an existing listHello, I have a data set with 3 columns that has pairs of roommates with their current room number.…
Joint correspondence analysisWhat do the dimensions in joint correspondence analysis indicate? …
Subscribe to:
Post Comments (Atom)
0 Response to Identifying and flagging matches
Post a Comment