Hello! I am looking at the results of a survey in which some respondents submitted a response prematurely and then later submitted a full response. I would like to drop the earlier response, but when I drop duplicates by name, it drops the later response instead (e.g. I have two observations from Joe Smith and I want to drop the response from 2 p.m. but Stata drops the response from 5 p.m. of the same day and keeps the 2 p.m. observation).
Since there's not too many, I decided to do it manually but my timestamp is automatically converted to type double, format %tc and so I'm having trouble figuring out how to use it in an if-statment. For example:
drop if (Timestamp == "05sep2019 15:51:41" & Name == "Joe Smith")
drop if (Timestamp == 05sep2019 15:51:41 & Name == "Joe Smith")
Both return a type mismatch and I don't know how to address this. Converting to an int or string lost data. Alternately, if there's a better way to drop duplicates in a "Name" variable by the *earlier* datetime, I'd really appreciate any suggestions on that.
Thank you!
0 Response to Using datetime in an expression? (Or dropping duplicates with an *earlier* datetime?)
Post a Comment