Here is the HTML that appears on my site:
<meta content="auth" name="param" />
<meta content="I_WANT_THIS" name="token" />
How can I use lxml.html to grab that?
Here is the HTML that appears on my site:
<meta content="auth" name="param" />
<meta content="I_WANT_THIS" name="token" />
How can I use lxml.html to grab that?
Use xpath to find the meta tag by name attribute and get the value of content attribute:
from lxml.html import fromstring
html_data = """ <meta content="auth" name="param" />
<meta content="I_WANT_THIS" name="token" />"""
tree = fromstring(html_data)
print tree.xpath('//meta[@name="token"]/@content')
prints:
['I_WANT_THIS']
meta tag anywhere in the html, this meta tag show have an attribute name with the value token, then give me the value of content attrubite.