I would like to remove stopwords from strings and I have received the advice to use txtool. However, it says unmatched quote when performing the command. The strings include texts of annual report files that were stored as strings in Stata by using Wordstat. I use Stata version 16. The text is transformed to lower case in the variable document_lc.

I counted the overall amount of words with wordcount and now I want to create a variable that specifies the amount of words without stopwords.

This is the command I used:

txttool document_lc, generate(text_wo_stopwords_german) noclean nooutput stopwords("/Volumes/Elements//Stopwords/German stopwords.txt)

Is it possible that the strings are too long? What might be a solution?

Thank you

Robert