Hi, I am new to Stata text similarity. I am trying to use lsemantica command to identify the similarity between two text variables. However, it seems that the command uses only one variable. Can you please help me understand how I should calculate cosine similarity between these two variables: variable "abstract" contains scientific article abstracts and the variable "dictionary" contains a list of scientific terms in biomedicine. Using text similarity, I am hoping to understand how close each abstract is to biomedical research. Thank you for all your help!
Related Posts with Text Similarity Using lsemantica
Drop observations after the last non missing value a variableHi, I would like to drop observations after the last non missing value of the variable WC01001A by …
matsize and Pesaran estimationLadies & Gentlemen, up to 7 regressors, STATA could produce regression output successfully. When…
Help on labeling categorical variablesHello I'm working with my data file but find difficulty on labelling derived categorical variables.…
Proportional Reduction in Error and Percent Correctly Predicted after Multinomial ProbitDear Colleagues, I am using Stata 14.2 to estimate a multinomial probit model of vote choice in the…
Comparison before-and-after mean for each individual (t-test)Hi, I am a graduate student. I have variables indicating a test score and retirement dummy. I want …
Subscribe to:
Post Comments (Atom)
0 Response to Text Similarity Using lsemantica
Post a Comment