I have scraped a website in order to find trending TV shows.
my html output looks something like this (obviously much longer but for brevitys sake):
<span class="btn-utility-container"><a class="btn-utility btn-watchlist "
data-button="watchlist" data-id="299830" data-name="American Pickers"
I want to find and then extract the data that comes after data-name=" and then ends with the next "
so in this case, the output would be: American Pickers (no quotations)
For reference, here is my code that does not work
wikis = ["http://www.tvguide.com/trending-tonight/"]
for wiki in wikis:
website = requests.get(wiki)
caps = re.findall(b'data-name=">(.|\n)*?<\/">', website.content) #Relevant line
soup? It isn't used