Hi all, I have a very large dataset of 970,000 observations, this dataset was given to be an organisation.
I tried to merge this dataset with another which came back with the error
stata does not uniquely identify observations in the master data
Which I figured it it has to do with my ID variable. I checked for any missing in both the master and merge file which there are none.
I then checked for duplicates as I figured out this would be the only other reason. (Although in none of my code have I myself introduced any duplicates)
I tried duplicates report
Array
I then tried to list the duplicates of course there were too many.
I then tried codebook - as you can see the unique values here differ.
Array
My question: Why does codebook show different number of unique values to the duplicates report which shows there are 959,798 unique values.
Related Posts with Duplicates report vs Codebook
Using Local in a LoopHello everyone, I'm having a problem commanding a local to do a function. The idea of the whole loo…
Beginner's question on datasetI have data on online competitions that are held weekly. I have data from 2001 all the way up to 201…
Standard DeviationIn my panel data, There are 20 individuals and 32-time points. Within the data set, the "Exchange ra…
Beginner question, loop command for net returns, error code r(198) "`var' invalid name"Hi everyone, I have a time series set of monthly currency spot rates from multiple countries. I als…
Marginplot and Khb graphDear statalists, It is my first post on this forum, so i hope that i won't wrong section or formulat…
Subscribe to:
Post Comments (Atom)
0 Response to Duplicates report vs Codebook
Post a Comment