Hello,
I'm trying to scrape a webpage, and have imported html code as one variable and am trying to extract my data using string functions.
To parse with different substrings that vary only in the middle (hence the asterisk below) I want to do something like this:
split htmlcode, p(`"<div class="ysf-"'*"<a"*">" [2nd string following data])
However, I don't think including the asterisk is acceptable use for the split command.
Can anyone recommend another way to do this?
Thank you!
-Reese
Related Posts with Web scraping / string parsing help
removing intercepts from groups of dummy variables in panel datahi there, so i have a panel regression of 174 nations and their GDP growth for every year 1994-2017,…
Data mergeHi Statalist, I am new to Stata and need help to merge two data sets. The merge or by variables are…
Not a numberHi, While trying to import a CSV dataset with Covid data for France, I realized there are numeric v…
Measuring the average Wage before and After movingHello Members, I have data on individuals moving across regions and I have grouped each region into…
Reshape the data-set into a specified formI have a data-set contains the following variables. And, I want to reshape the data-set into a speci…
Subscribe to:
Post Comments (Atom)
0 Response to Web scraping / string parsing help
Post a Comment