Python Web scraping: Finding specific link

Question

I'm trying to isolate a specific link for images from a web page but can't quite get there. The HTML looks something like:

<head>
   <img alt="Generic title" src="https://genericURL/photo/picture.jpg/"> 
   <img src="https://genericurl/.../">
   <img src="https://genericurl/.../">
   ....

I am able to return many links but the link I specifically want is the top one shown, it is the only link containing /photo/picture.jpg. I have tried using the answer from Find specific link text with bs4 and other variations but haven't figured it out yet. Is anyone able to take a look please?

My code:

links = soup.findAll('img', {'src': re.compile('^http://image\d+')})
for link in links:
     print(link.text)

EDIT: Using the suggestions I realised that the link format was changing based on the filter I was using, e.g.: when I was printing the entire web page I saw the link as http://image.... However when I was using findAll('img', {'src' ... the link was outputting as https://img so I was trying to re.compile the wrong things.

Why not re.compile("photo/picture.jpg")?

akuiper
– akuiper

2017-03-11 01:00:51 +00:00
Commented Mar 11, 2017 at 1:00 — akuiper
– akuiper, Commented Mar 11, 2017 at 1:00

宏杰李 · Accepted Answer · 2017-03-11 05:06:43Z

3

soup.find_all("img", alt="Generic title")

you should use alt as filter.

answered Mar 11, 2017 at 5:06

宏杰李

12.2k2 gold badges32 silver badges37 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

B.Adler · Accepted Answer · 2017-03-11 03:00:41Z

0

import re
links = soup.findAll('img', {'src': re.compile('^http://image\d+')})
for link in links:
    if re.search('photo\/pictures\.jpg', link.get('href', ''), re.IGNORECASE):
        link_i_want = link.get('href')
        break

answered Mar 11, 2017 at 3:00

B.Adler

1,5481 gold badge18 silver badges26 bronze badges

Collectives™ on Stack Overflow

Python Web scraping: Finding specific link

2 Answers 2

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related