Hi,

I am trying to tackle the problem of comparing text between two string variables and identify (and extract) “updated” parts.
String Var1 String Var2 Result new variable
“I wrote this in 2020” “I wrote this in 2020. I updated this in 2021” I updated this in 2021
“someone said this” “In 2020, someone said this” In 2020,
“numbers reported in 2020” “numbers changed in 2021” changed 2021
I found some VBA script for Excel but only works for two cells (not automated to check two columns via loops). I don’t know how to modify VBA scripts. There is a STATA command for sequence analysis (based on Needleman-Wunsch) but I cannot figure out how it applies to comparing sentences. Anyone knows any other program or how the sequence analysis works for comparing sentences?

Thanks!

Xiaodong