I am using Robert Picard's excellent project package. I would like to keep track of the number of observations dropped while cleaning and merging my data, along with the reasons.
I know how to do the counts, but my question is how to keep track and generate a final tally.
It can be written to the logs, but then I have to manually go and extract them from the logs.
If I append them to a separate file (e.g. a text file), I will have an issue when one file that drops observations is edited and another file is not, since the non-edited file may not be re-run (if nothing it depends on is changed, which is managed by project), and the counts will get lost.
I was thinking along the line of a .dta file with a variable for each of the different drop reasons, so that the counts can each be changed individually, but this seems like a very "hacky" solution. Also, the project package doesn't seem to have a way to handle a file that will be changed by more than one do-file.
I would love to hear any suggestions.
Thank you!
Related Posts with Tracking dropped observation counts
Instrumental Variable, combining datasets Hello, I am using an instrumental variable for trade openness to analyze the effect of trade openn…
Expand data based on a time variable & extract information in panel dataDear Statalisters, Below is my data (show the two five records), which includes gender, begin date …
Adjusting the dataset to create a Panel Data AnalysisHello everybody, I have probably a rather simple question, but I still spent hours to solve it, so …
How do I choose which dummy variable to omit when I have dummies in the form i.variableHello, I have split up my independent variable of interest into quantiles where the first quantile …
Quotes do not match error in putexcel commandHello, I'm having terrible trouble with quotes supposedly not matching in a putexcel command. This i…
Subscribe to:
Post Comments (Atom)
0 Response to Tracking dropped observation counts
Post a Comment