Extracting links with specific class using css selectors

Question

I have the following HTML structure
I want to extract all the links with the class:dev-link

<a class="dev-link" href="mailto:[email protected]" rel="nofollow" title='Photoshoot"</a>

I am using the below code to extract the link in scrapy

response.css('.dev-link::attr(href)').extract()

I am getting the correct output but is this the right way to use css selectors??

if you are using python, why not using regex?

Dhaval Jardosh
– Dhaval Jardosh

2018-01-25 17:47:07 +00:00
Commented Jan 25, 2018 at 17:47 — Dhaval Jardosh
– Dhaval Jardosh, Commented Jan 25, 2018 at 17:47

Yuseferi · Accepted Answer · 2018-01-25 17:56:37Z

1

As you can see in Scrapy Documentation there are two methods to scrap data, CSS Selector and XPath Selector both are works correctly but XPath needs some practice to get expert, in my opinion, Xpath is more power in special cases you can scrap data easier that CSS selector ( but of course you can get them with CSS selector too),

what you did is correct

 link = response.css('.dev-link::attr(href)').extract_first()

and also you can get it with the following too

link = response.xpath('/[contains(@class,’dev-link’)]/@href').extract_first()

answered Jan 25, 2018 at 17:56

Yuseferi

8,83011 gold badges77 silver badges106 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Extracting links with specific class using css selectors

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related