Python beautifulsoup match regex after string

Question

I am using BeautifulSoup and Python to scrape a webpage. I have a BS element,

a = soup.find('div', class_='section lot-details')

which returns a series of list objects as per below.

<li><strong>Location:</strong> WA - 222 Welshpool Road, Welshpool</li>
<li><strong>Deliver to:</strong> Pickup Only WA</li>

I want to return the text after each str

WA - 222 Welshpool Road, Welshpool
Pickup Only WA

How do I get this out of the BS object? I'm unsure of the regex, and also how this interacts with BeautifulSoup.

How does getting div return li?

AKS
– AKS

2016-05-19 13:43:39 +00:00
Commented May 19, 2016 at 13:43 — AKS
– AKS, Commented May 19, 2016 at 13:43

Rahul K P · Accepted Answer · 2016-05-19 14:53:41Z

1

(?:</strong>)(.*)(?:</li>) capture field \1 (.*) would do the work.

Python code sample:

In [1]: import re
In [2]: test = re.compile(r'(?:</strong>)(.*)(?:</li>)')
In [3]: test.findall(input_string)
Out[1]: [' WA - 222 Welshpool Road, Welshpool', ' Pickup Only WA']

check it here https://regex101.com/r/fD0fZ9/1

edited May 19, 2016 at 14:53

Rahul K P

16.2k4 gold badges40 silver badges56 bronze badges

answered May 19, 2016 at 13:29

Hermes Martinez

905 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

myeewyee Over a year ago

This works & also gives me a method for other more general cases as well.

AKS · Accepted Answer · 2016-05-19 13:47:57Z

1

You don't really need regex. If you have your li tags in a list:

>>> for li in li_elems:
...     print li.find('strong').next_sibling.strip()

WA - 222 Welshpool Road, Welshpool
Pickup Only WA

This is assuming that there is only one strong element in the li and text is afterwards.

Or, alternatively:

>>> for li in li_elems:
...     print li.contents[1].strip()

WA - 222 Welshpool Road, Welshpool
Pickup Only WA

answered May 19, 2016 at 13:47

AKS

20k3 gold badges47 silver badges55 bronze badges

Collectives™ on Stack Overflow

Python beautifulsoup match regex after string

2 Answers 2

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related