Scrapy - CSS selectors

Question

I'm trying to understand how CSS selectors work using Scrapy. but I definitely don't understand to navigate in several html tag. For example, I'm trying to extract all the href link in the div id "portefeuille_bloc":

code screenshot

I tried this code but I can't identify where the mistake is:

response.css('div[id=portefeuille_bloc a::attr(href)').extract()

Furthermore, I tried to go deeper in the structure, and get all the h3 tag in the sub-division "portefeuille_bloc_bloc:

code screenshot

I think your main mistake was missing the ending square bracket after portefeuille_bloc. — Anthony Mills
– Anthony Mills, Commented Dec 3, 2019 at 14:44

Anthony Mills · Accepted Answer · 2019-12-03 14:42:57Z

1

Try this:

response.css('div#portefeuille_bloc a::attr(href)').getall()

See this doc page for more ideas:

https://docs.scrapy.org/en/latest/topics/selectors.html

answered Dec 3, 2019 at 14:42

Anthony Mills

8,7764 gold badges36 silver badges52 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Ikram Khan Niazi · Accepted Answer · 2019-12-04 07:22:54Z

0

Try this:

response.css('#portefeuille_bloc ::attr(href)').extract()

There is no need to use HTML tags with ids and classes.

answered Dec 4, 2019 at 7:22

Ikram Khan Niazi

8017 silver badges17 bronze badges

Collectives™ on Stack Overflow

Scrapy - CSS selectors

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related