Hi, I am new to Stata text similarity. I am trying to use lsemantica command to identify the similarity between two text variables. However, it seems that the command uses only one variable. Can you please help me understand how I should calculate cosine similarity between these two variables: variable "abstract" contains scientific article abstracts and the variable "dictionary" contains a list of scientific terms in biomedicine. Using text similarity, I am hoping to understand how close each abstract is to biomedical research. Thank you for all your help!
Related Posts with Text Similarity Using lsemantica
Creating a lead variable for fixed effect regressionsHi, I'm trying to run a couple of fixed effect regressions with my data. I want to understand whet…
Adding independent variables to xtlogitDear all, For my thesis I have to research the causes of layoffs for (big) Belgian firms. Period 20…
first letters of each elementDear All, I have this data set: Code: * Example generated by -dataex-. For more info, type help dat…
Restricting ObservationsHi all, hoping this has an easy answer. I have different models which have different variables in th…
Problem with the data form a multi-line Excel cellDear community, I loaded an Excel file where two columns contain more than one line (The cell inclu…
Subscribe to:
Post Comments (Atom)
0 Response to Text Similarity Using lsemantica
Post a Comment