I would like to remove stopwords from strings and I have received the advice to use txtool. However, it says unmatched quote when performing the command. The strings include texts of annual report files that were stored as strings in Stata by using Wordstat. I use Stata version 16. The text is transformed to lower case in the variable document_lc.
I counted the overall amount of words with wordcount and now I want to create a variable that specifies the amount of words without stopwords.
This is the command I used:
txttool document_lc, generate(text_wo_stopwords_german) noclean nooutput stopwords("/Volumes/Elements//Stopwords/German stopwords.txt)
Is it possible that the strings are too long? What might be a solution?
Thank you
Robert
Related Posts with Stopwords Removal with Txttool
Unable to scroll up to see the beginning of the results.Hello All, In recent times I facing a unique problem in my Stata. Whenever I am using the tab of th…
Unbalanced Panel data to balanced panel data for 5 YearsDear All I have an unbalanced panel data from year 2006 to year 2018. There are firms with data as …
'Wrong' sign with fixed effectsHi there, I was hoping to get help regarding a change is signs of a coefficient when using fixed ef…
Calculating distance for intensity analysis using DiDHello, I am currently trying to calculate distance between conflicts and population clusters. I wou…
Crossed random effects model not working after update to Stata/MP 16.1 if sample size exceeds 50 000Dear Statalisters, I've found that a fairly simple crossed random effects multilevel model I estima…
Subscribe to:
Post Comments (Atom)
0 Response to Stopwords Removal with Txttool
Post a Comment