Hi,
I'd like to identify observations when a string variable contains multiple substrings.
In my dataset, I have a string variable containing crime descriptions. For that string variable, I want to identify when "REG" and "GUN" appear together in the same cell.
The data is messy and not standardized. Here is an example of how some of the strings look:
crime
GUN OFFENDER REGISTRATION
GUN OFFENDER-FAIL TO REGISTER
GUN OFFENDER/FAIL REG OFFENDER
GAS/AIR/PAINTBALL GUN: POSSESS
FIRING HANDGUN IN CITY LIMITS
FRAUDULENT POSSESSION OF VEH OWNERSHIP REG. PLATE
KNOWINGLY HOLDING FALSIFIED VEH. REG. PLATE
I've successfully used the strpos command to isolate observations containing a single substring, i.e. :
l if strpos(crime, "REG")
l if strpos(crime, "GUN")
And I've been able to identify observations that contain either one substring or another, i.e. :
l if strpos(crime, "REG" "GUN")
But I haven't been able to figure out how to identify if a single cell contains both "REG" and "GUN".
Any advice is appreciated.
Related Posts with Identify if single cell contains multiple substrings
How to keep >2 observations over 2-yearsI wonder if anyone would be so kind as to help me with the appropriate syntax. I have a data set de…
Matched dataset based on multiple variablesHello, I am attempting to create a dataset of matched observations based on multiple criteria. I h…
Monthly observations and quarterly fixed effectsHi everyone, Hope you are doing well and maybe you have some handful insights for the following. M…
getting 95% CIs for multinomial variable, with clusteringHi all, I'd like to obtain 95% CIs for a variable with 3 categories - male, female, and unknown; ove…
Bar Graph with multiple dummy variables on x-AxisHi, I am trying to create a bar graph in Stata, that describes a mean of one variable (v1) for seve…
Subscribe to:
Post Comments (Atom)
0 Response to Identify if single cell contains multiple substrings
Post a Comment