Hi,
I'd like to identify observations when a string variable contains multiple substrings.
In my dataset, I have a string variable containing crime descriptions. For that string variable, I want to identify when "REG" and "GUN" appear together in the same cell.
The data is messy and not standardized. Here is an example of how some of the strings look:
crime
GUN OFFENDER REGISTRATION
GUN OFFENDER-FAIL TO REGISTER
GUN OFFENDER/FAIL REG OFFENDER
GAS/AIR/PAINTBALL GUN: POSSESS
FIRING HANDGUN IN CITY LIMITS
FRAUDULENT POSSESSION OF VEH OWNERSHIP REG. PLATE
KNOWINGLY HOLDING FALSIFIED VEH. REG. PLATE
I've successfully used the strpos command to isolate observations containing a single substring, i.e. :
l if strpos(crime, "REG")
l if strpos(crime, "GUN")
And I've been able to identify observations that contain either one substring or another, i.e. :
l if strpos(crime, "REG" "GUN")
But I haven't been able to figure out how to identify if a single cell contains both "REG" and "GUN".
Any advice is appreciated.
Related Posts with Identify if single cell contains multiple substrings
Meta Forestplot Subgroups Summary vs Single StudiesDear Statalist, I am using the meta forestplot command. Depending on how many subgroup variables one…
Local behaviour I don't understandHi all, I'm having some issues with counting the number of elements in a local. Take the examples b…
Combine Time-series and Cross-sectional data for Diff-in-DiffHello everyone, I'm trying to answer the question if election poll standings influenced the stock p…
Drop Duplicates comparing two variables and keep one of themHi everyone, I am trying to drop duplicates in my dataset and keep one of them. I know the command…
I need help with marginsplotHi. I want to use marginsplot after FRM regression. I don't want to see all variables, but only one …
Subscribe to:
Post Comments (Atom)
0 Response to Identify if single cell contains multiple substrings
Post a Comment