Hi,
I would like to spot the observations that are very much alike within one string variable.
Let's say for instance that I have a variable with 4 observations, such as:
var1
observation1: "cat"
obs 2: "caty"
obs 3: "the cat is beautiful"
obs 4: "cat"
I would like to have some distance measure that tells me that observation 1 and 4 are equal, observations 1 and 2 are quite similar, but observations 1 and 3 are very different. Is it possible?
Thanks
Related Posts with similar observations within one variable
Please help: Merge 1:1 delivers different results after each runthroughI want to merge two datasets based on one variable (neither dataset contains duplicates on this vari…
marginsplot title (and text box) centered on graph region, not plot region-marginsplot- expands the title and text box across the entire graph region, which twoway plots don'…
Predicted probabilities after multilevel multinomial logistic regression with random effectsI am trying to get predicted probabilities of a 7-category level-1 variable after running a multinom…
Missing Data in my imputed resultsHello, I am having some problems with the final results of my imputation. I administered the Beck A…
Two-way fixed effects in unbalanced panelHi everyone, I am replicating a paper in an economic journal which uses an unbalanced panel of firm…
Subscribe to:
Post Comments (Atom)
0 Response to similar observations within one variable
Post a Comment