Reading a specific class in a webpage using python

Question

I have a script which reads data from a webpage using HTMLParser:

import urllib
from HTMLParser import HTMLParser
import re


class get_HTML_Info(HTMLParser):
    def handle_data(self, data):
        print data


adib = urllib.urlopen('http://www.bulldoghax.com/secret/spinner')
htmlsource = adib.read()
adib.close()

parser = get_HTML_Info()
parser.feed(str(htmlsource))

I end up with two set of data like this:

bulldoghax

8530330882

In the terminal, I just want to extract only that number and set it to a string in python.

Mike · Accepted Answer · 2016-03-11 15:26:13Z

2

Use Beautiful Soup for scraping data.

pip install BeautifulSoup

import urllib
from HTMLParser import HTMLParser
import re

adib = urllib.urlopen('http://www.bulldoghax.com/secret/spinner')

htmlsource = adib.read()

from bs4 import BeautifulSoup
soup = BeautifulSoup(htmlsource)
for each_div in soup.findAll('div',{'class':'number'}):
    print each_div.text

edited Mar 11, 2016 at 15:26

Mike

20.4k13 gold badges64 silver badges96 bronze badges

answered Mar 11, 2016 at 12:53

Himanshu dua

2,5141 gold badge22 silver badges28 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

shoomy Over a year ago

Thank you!, that's perfect!, I just had to change soup = BeautifulSoup(htmlsource) to soup = BeautifulSoup(htmlsource, "lxml") because it gave me an error the first time i tried it

shoomy Over a year ago

@himanshu_dua can you help me with writing a code which sends that number a cookie value for this website http://www.bulldoghax.com/secret/codes

Maltysen · Accepted Answer · 2016-03-11 12:45:06Z

1

Simple, here:

n="".join(filter(str.isdigit, data))

It filters the string based on being a number or not, then joins it into a string.

answered Mar 11, 2016 at 12:45

Maltysen

1,94617 silver badges18 bronze badges

1 Comment

shoomy Over a year ago

Thank you, now it's only showing the numbers, is there anyway I can remove the '\n' new line things, I just want the output to be that number

Collectives™ on Stack Overflow

Reading a specific class in a webpage using python

2 Answers 2

2 Comments

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related