Hello,
I am trying to clean up a large dataset of addresses, where there are several different versions of the same address (e.g. '101 Main St' can also be listed as '101 Main Street') but they are associated with the same individual. I want to create a consolidated dataset with consistent addresses.
There are some addresses that I need to drop because they are not actual street addresses - e.g. '*103', '*Dept 164'. Many of these erroneous addresses begin with an asterisk, others with alternative symbols.
Is there a way to selectively drop those with symbology or do I need to do something like 'split address, generate (new)' and proceed from there?
Thank you!
Related Posts with Dropping specific addresses beginning with '#' or '$'
estimating targets using DEAHello, I am looking to estimate target output values for inefficient DMUs using an output oriented …
BOX PLOT WITH INDIVIDUAL OBSERVATION--How to writeHi to everybody. I have to do a boxplot with individual observations using the dataset below: clea…
Merging us patentsview with IPC codes, multiple ipc codes for one patent_idI want to calculate the radicalness of patents, so I am working with the backward citation. To calcu…
ivreg2h warningHi, I am trying to run an IV regression with internal instruments only (as per Lewbel 2012). My equ…
Bayestest model not showing posterior probabilitiesDear Stata Users community, I am trying to compare the posterior probabilities of a series of ologi…
Subscribe to:
Post Comments (Atom)
0 Response to Dropping specific addresses beginning with '#' or '$'
Post a Comment