Hi,
I'd like to identify observations when a string variable contains multiple substrings.
In my dataset, I have a string variable containing crime descriptions. For that string variable, I want to identify when "REG" and "GUN" appear together in the same cell.
The data is messy and not standardized. Here is an example of how some of the strings look:
crime
GUN OFFENDER REGISTRATION
GUN OFFENDER-FAIL TO REGISTER
GUN OFFENDER/FAIL REG OFFENDER
GAS/AIR/PAINTBALL GUN: POSSESS
FIRING HANDGUN IN CITY LIMITS
FRAUDULENT POSSESSION OF VEH OWNERSHIP REG. PLATE
KNOWINGLY HOLDING FALSIFIED VEH. REG. PLATE
I've successfully used the strpos command to isolate observations containing a single substring, i.e. :
l if strpos(crime, "REG")
l if strpos(crime, "GUN")
And I've been able to identify observations that contain either one substring or another, i.e. :
l if strpos(crime, "REG" "GUN")
But I haven't been able to figure out how to identify if a single cell contains both "REG" and "GUN".
Any advice is appreciated.
Related Posts with Identify if single cell contains multiple substrings
Dropping observation under conditionsDear expers, my data: Code: * Example generated by -dataex-. To install: ssc install dataex clear …
density plothello. I am not familiar with the density plot and I have a simple question. I have information abou…
Drop variable if varlabel contains certain expressionHey Statalisters, I am trying to drop all variables that have the expression "Domestic Currency" in…
Nested loop over observations and varlistHello everyone! I am trying to do something which, upon reading manuals and forum entries, seems per…
How to treat differing likert scalesHi all, I am using panel data from British Election Survey to perform a fixed effect OLS regression …
Subscribe to:
Post Comments (Atom)
0 Response to Identify if single cell contains multiple substrings
Post a Comment