Hi,
I would like to spot the observations that are very much alike within one string variable.
Let's say for instance that I have a variable with 4 observations, such as:
var1
observation1: "cat"
obs 2: "caty"
obs 3: "the cat is beautiful"
obs 4: "cat"
I would like to have some distance measure that tells me that observation 1 and 4 are equal, observations 1 and 2 are quite similar, but observations 1 and 3 are very different. Is it possible?
Thanks
Related Posts with similar observations within one variable
Is it possible to "export delimited" with variable labels instead of variable names?Dear Statalisters, I'm quite new to Stata and I've run into what seems like a very "basic" problem.…
filling missing data per id from cells below.Dear Stata Community, I am working with long panel data. I have multiple observations per id. In m…
Oaxaca decomposition (nledcompose): how to deal with categorical Y'sDear all, I am new to the Oaxaca-Blinder decomposition, and I'ld like to know your opinion about ho…
Label variable over multiple linesI want to label multiple variables in my dataset in one command, like below: lab var /// id "ID" //…
Event Study - Loop for calculating CARsHello everyone. I am trying to do a loop to calculate CARs (Cumulative Abnormal returns) for an Eve…
Subscribe to:
Post Comments (Atom)
0 Response to similar observations within one variable
Post a Comment