Hi, I am new to Stata text similarity. I am trying to use lsemantica command to identify the similarity between two text variables. However, it seems that the command uses only one variable. Can you please help me understand how I should calculate cosine similarity between these two variables: variable "abstract" contains scientific article abstracts and the variable "dictionary" contains a list of scientific terms in biomedicine. Using text similarity, I am hoping to understand how close each abstract is to biomedical research. Thank you for all your help!
Related Posts with Text Similarity Using lsemantica
Proof of parallel trends in Difference in Difference using a leads and lags regression?Dear Statalist users, I need to prove that the treatment group and the control group follow similar…
Renaming an existing sheet in excel via xl() or putexcelHi im not sure if this is possible or not, but does anyone know if you can rename an existing sheet …
Generate dummy variable attributed to all observations of an ID if one of the observations meets criteriaHi, I haven't had much experience using Stata and was wondering whether one could generate a dummy v…
Robustness CheckHi, For my project, I've just been running simple OLS regressions. I was wondering if there was any…
Using subpop reg to compare regression coefficients between groups.I am investigating how access to water and toilets in the household affects educational attainment i…
Subscribe to:
Post Comments (Atom)
0 Response to Text Similarity Using lsemantica
Post a Comment