Hello,
I'm trying to scrape a webpage, and have imported html code as one variable and am trying to extract my data using string functions.
To parse with different substrings that vary only in the middle (hence the asterisk below) I want to do something like this:
split htmlcode, p(`"<div class="ysf-"'*"<a"*">" [2nd string following data])
However, I don't think including the asterisk is acceptable use for the split command.
Can anyone recommend another way to do this?
Thank you!
-Reese
Related Posts with Web scraping / string parsing help
File pathHello ! I am facing a problem after giving command to include file : . do "C:\Users\GP\AppData\Loc…
Creating a year variable in a longitudinal datasetHi All My dataset is longitudinal in the long format, and each individual has 5 rows of data with e…
Interpretation of the covariance between random intercepts and slopes after mixed (multilevel analysis)Dear all, I am trying to estimate a model with a three-level hierarchical structure using STATA 16’…
graphical markersDear All, I have this graph Code: * Example generated by -dataex-. To install: ssc install dataex c…
Extract coefficients and SE from stored estimatesI am looking for a way of extracting coefficients and standard errors(SE) after storing them. For ex…
Subscribe to:
Post Comments (Atom)
0 Response to Web scraping / string parsing help
Post a Comment