How to get src attribute from <image/> with Python

Question

I am scraping data from one site, and I need to find one img. I get it but the output is not what I need.

I have tried looking online for solutions, changing code but nothing worked.

r = requests.get(baseurl)
content = r.content
soup = BeautifulSoup(content, "html.parser")

images = soup.findAll('img')[1]
print(images)

Output I get:

<img src="https://cdn.rubyrealms.com/images/WKpivrdGBJJ9p6etIY2aJpixikFj4vnpmpPR9pXjK4Y8K.png" style="border-radius: 5px"/>

Output I need:

cdn.rubyrealms.com/images/WKpivrdGBJJ9p6etIY2aJpixikFj4vnpmpPR9pXjK4Y8K.png

(I tried print(images.text))

Parse the src attribute from your <img> element

Cody Caughlan
– Cody Caughlan

2019-07-08 22:24:50 +00:00
Commented Jul 8, 2019 at 22:24 — Cody Caughlan
– Cody Caughlan, Commented Jul 8, 2019 at 22:24
Try images.get('src')

drec4s
– drec4s

2019-07-08 22:28:36 +00:00
Commented Jul 8, 2019 at 22:28 — drec4s
– drec4s, Commented Jul 8, 2019 at 22:28

0xPrateek · Accepted Answer · 2019-07-08 22:49:31Z

4

you can get the img tag's src content using ;

images = soup.findAll('img')[1]
print(images.get("src"))

or

images = soup.findAll('img')[1]
print(images['src'])

Output

https://cdn.rubyrealms.com/images/WKpivrdGBJJ9p6etIY2aJpixikFj4vnpmpPR9pXjK4Y8K.png

The problem with print(images.text) is that it is used to extract the text in between two tags and you want to extract the text which is inside the tag itself.

Hope this helps you :)

edited Jul 8, 2019 at 22:49

answered Jul 8, 2019 at 22:37

0xPrateek

1,18811 silver badges29 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

João Teixeira · Accepted Answer · 2019-07-08 22:37:25Z

1

Here's a sample you can adapt:

parser.feed('<img src="python-logo.png" alt="The Python logo">')
Start tag: img
attr: ('src', 'python-logo.png')

REFERENCE: https://docs.python.org/3/library/html.parser.html

answered Jul 8, 2019 at 22:37

João Teixeira

691 silver badge12 bronze badges

Collectives™ on Stack Overflow

How to get src attribute from <image/> with Python

2 Answers 2

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related