Dear Statalist members,
I want to identify the political connections of the firms. For this I have companies officials names and on the other side I have names of the politicians. I want to trace the politicians who are also the company's directors/officials etc. I want to compare two columns having string data, and the names in both columns may have little difference as well. a politician name may appear little different "Muhammad Saleem Khan" in political data column but it may be "Saleem Khan" in firms data column. I have 6000 rows in corporate data column and 440000 rows in political data column. to cut it short every name of the first variable need to be searched in the complete data of the 2nd variable.