I am trying to merge two huge datasets.
To do so, I am generating a unique identifier as
Code:
gen id_mas = _n
For instance:
Array
as you can see the number displayed is 2.33e+07 but it is precisely 23309572. The number below is displayed again as 2.33e+07 but it's 23309514. So they are uniquely defined but stata seems to care only about the rounded value.
How can I solve this issue and tell stata that these are two separate numbers?
Thank you
0 Response to Duplicates in very large dataset
Post a Comment