Hello,
I am trying to clean up a large dataset of addresses, where there are several different versions of the same address (e.g. '101 Main St' can also be listed as '101 Main Street') but they are associated with the same individual. I want to create a consolidated dataset with consistent addresses.
There are some addresses that I need to drop because they are not actual street addresses - e.g. '*103', '*Dept 164'. Many of these erroneous addresses begin with an asterisk, others with alternative symbols.
Is there a way to selectively drop those with symbology or do I need to do something like 'split address, generate (new)' and proceed from there?
Thank you!
Related Posts with Dropping specific addresses beginning with '#' or '$'
Significance of variablesHello, I have one doubt about the significance of an independent variables. In my database I have e…
Quintile graph as bar chartHi all, I'm new with stata and learning to visualize data in stata. I have some issue with bar graph…
Bootstrapping standard errors moves the p-value of a variable from 0.000 to 1.000Hi all, To overcome the generated regressor problem, I perform bootstrapping to correct the standar…
Whether paired variable reduction is difference between groupsI have two variables, triglyceride before and after treatment. So I run the paired t test to see if…
Converting 5-year interval to yearly panel dataDear Statalisters, I would like to insert the missing years in a large multi-country five-year inter…
Subscribe to:
Post Comments (Atom)
0 Response to Dropping specific addresses beginning with '#' or '$'
Post a Comment