I have a large dataset with rows that include a unique identifier 'id' and a unique time variable called 'timeto'. There are variable numbers of duplicate id's but all in the dataset are duplicated at one point or another. I would like to create a flag indicating when two rows with the same id have 'timeto' values that are <= 2. If there are more than one, I'd like to count up. Examples below:
starting data:
id timeto
1 3
2 7
2 8
2 0
2 14
2 15
3 4
3 6
ending data:
id timeto flag
1 3 0
2 7 1
2 8 1
2 0 0
2 14 2
2 15 2
3 4 1
3 6 1
Eventually, for those 'pairs' of rows with the same ID and timeto within 2, I'd be planning to pivot wide so there is a single row for each ID. Any help would be appreciated!
Related Posts with Identifying and flagging matches
Replacing missing rows of a variableDear All I have a file with more than 1k observations and two variables. One of the variables Y is c…
Correct for Selection on Independent VariablesDear Statalists, I am confused about how to correct for selection on one independent variable. I wa…
Exporting Logistic Regression output table - command using 'svyset'Dear experts, I am running logistic regression model using 'svyset' command in Stata 15. I am unable…
Panel VECM in STATA I am giving a general description of the model in a panel setting: Let y, a, b, and z are four vari…
How to sort data for distinct IDs with multiple visits and multiple valuesDear all, I have a dataset that has multiple visits for each person. However each person had a diff…
Subscribe to:
Post Comments (Atom)
0 Response to Identifying and flagging matches
Post a Comment