Dear all,
I have a variable called place_birth in my dataset. Some of the locations weren't recorded properly.
place_birth
Feucherolles (Saint-James = Le château royal de Sainte-Gemme)
ST B(?), canton de Chaillot
(?Chanvrand)Canton de La Guiche
Seine-Inférieure (Seine-Maritime)
Épinay-sur-Seine ,
Autine (?) Outines
Darrois ? Darvois
I would like to do two things.
First, separate what is inside parenthesis () and comma , and = from the text. With what I separate I can create an new variable called place_new
Second, clean both variable from weird signs like ?, =, . at the end, /, etc...
For example
Épinay-sur-Seine ,
should look like
Épinay-sur-Seine
replace ? and (?) with a comma
Autine (?) Outines
it becomes
Autine , Outines
For this one:
Feucherolles (Saint-James = Le château royal de Sainte-Gemme)
Eliminate "Saint-James =" and just leave:
Feucherolles (Le château royal de Sainte-Gemme)
Then I can separate the strings by comma and parenthesis so that for example:
place_birth
(?Chanvrand)Canton de La Guiche
becomes:
place_new
Chanvrand
Or:
place_birth
Seine-Inférieure (Seine-Maritime)
Becomes in the new var:
place_new
Seine-Maritime
Related Posts with Cleaning string variable
Is it possible to match families/groups and not individual observations?I have two datasets. One dataset has families of 2 members and the other dataset has families of une…
Constraining Discrimination Parameters in Three-parameter Logistic Item Response Theory ModelGood day everyone in the list, please how can I constrain a discrimination parameter in stata code g…
R2000 in logistic regression and I can't find what is missingFirst off I want to say that I am new to Stata and also to statistics. I have searched for a solutio…
Would boot strapping be used as an alternative to confidence interval along with Chi-Square test statistic?To see the prevalence of tobacco consumption in socio demographic variables their difference if stat…
Probit marginal effects: correct interpretation?I am running a regression of a binary variable (proficient in math or not) on education expenditure …
Subscribe to:
Post Comments (Atom)
0 Response to Cleaning string variable
Post a Comment