Parse html value in python

Question

I have following html:

<td>
   <input maxlen="1" name="db" size="1" type="text" value="25"/>
   <div style="display:inline-block;position:relative;top:6px;left:0px;width:20px;">
    <input class="p_b" name="ta" style="height:1em; width:1.5em;line-height:1em;padding:0px;margin:0px;border:0px;background-color:#f3f3f3" type="submit" value="▴"/>
    <input class="p_b" name="ta" style="height:1em; width:1.5em;line-height:1em;padding:0px;margin:0px;border:0px;background-color:#f3f3f3" type="submit" value="▾"/>
   </div>
   <span style="position:relative;top:8px">
    
   </span>
   <input maxlen="1" name="dc" size="1" type="text" value="0"/>
   <div style="display:inline-block;position:relative;top:6px;left:0px;width:20px;">
    <input class="p_b" name="tb" style="height:1em; width:1.5em;line-height:1em;padding:0px;margin:0px;border:0px;background-color:#f3f3f3" type="submit" value="▴"/>
    <input class="p_b" name="tb" style="height:1em; width:1.5em;line-height:1em;padding:0px;margin:0px;border:0px;background-color:#f3f3f3" type="submit" value="▾"/>
   </div>
  </td>

I need to extract both numbers from value="25" and value="0". I made a workaround like:

y = soup.findAll('input', {'type':'text'})
a = re.findall('(?<=value=")(\d*)',str(y))

But I think there is should be more direct way to do it via parser, can anyone help with it?

Does this answer your question? Python beautifulsoup - getting input value — Ckrielle
– Ckrielle, Commented Dec 15, 2020 at 11:57
@Parolla I know and you don't have to stick to it either. XPath has been done for those kind of queries. — MetallimaX
– MetallimaX, Commented Dec 15, 2020 at 12:01

Parolla · Accepted Answer · 2020-12-15 11:42:17Z

1

Try below code line to extract @value from each input node

values = [element['value'] for element in soup.findAll('input', {'type':'text'})]

P.S. Note that using regex for web-scraping is a very bad practice - there are enough web-scraping tools that can easily do this for you (for instance, BeautifulSoup and lxml can be used in Python)

answered Dec 15, 2020 at 11:42

Parolla

4072 silver badges6 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Parse html value in python

1 Answer 1

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related