Tuesday, November 27, 2018

Web scraping / string parsing help

Hello,

I'm trying to scrape a webpage, and have imported html code as one variable and am trying to extract my data using string functions.

To parse with different substrings that vary only in the middle (hence the asterisk below) I want to do something like this:

split htmlcode, p(`"<div class="ysf-"'*"<a"*">" [2nd string following data])

However, I don't think including the asterisk is acceptable use for the split command.

Can anyone recommend another way to do this?

Thank you!

-Reese






No comments:

Post a Comment