Dear Forum-Users,
I have a data set in long format – combining prescription data and dates of visits which looks like this – except for the „newvar“ – which I would like to obtain.
| patID | 
docID | 
prescDATE | 
visitDATE | 
newvar | 
 
| 1XY | 
A | 
10jan2018 | 
. | 
2 | 
 
| 1XY | 
B | 
10mar2018 | 
. | 
0 | 
 
| 1XY | 
A | 
. | 
09jan2018 | 
. | 
 
| 1XY | 
A | 
. | 
10jan2018 | 
. | 
 
| 1XY | 
B | 
. | 
15may2018 | 
. | 
 
| 1XY | 
C | 
. | 
13mar2018 | 
. | 
 
| 2GH | 
A | 
20jan2018 | 
. | 
1 | 
 
| 2GH | 
C | 
25mar2018 | 
. | 
0 | 
 
| 2GH | 
A | 
. | 
18jan2018 | 
. | 
 
| 2GH | 
C | 
. | 
15mar2018 | 
0 | 
 
| 3WE | 
A | 
02jan2018 | 
. | 
1 | 
 
| 3WE | 
C | 
05mar2018 | 
. | 
0 | 
 
| 3WE | 
A | 
 | 
03jan2018 | 
. | 
 
| 3WE | 
B | 
 | 
07mar2018 | 
. | 
 
 
 
 | 
 
What I want to do is „check“ whether each patient (patID) has been - for each prescription (prescDATE) - to the doctor who issued the prescription (docID) within a certain timeframe (lets say five days before or after the prescription date). If this is true, the "newvar" should count the visits within that timeframe.
I guess one solution would be to convert the dataset in the wide format and loop through the prescription dates?
I m looking for a clue how this could be handled in the long format?
Many Thanx, Lynde
 
0 Response to Calculating timeframes between variables in long format
Post a Comment