Using regex to extract substrings

Question

I have a string:

s = r'"url" : "a", "meta": "b", "url" : "c"'

What I want is to capture the substring url: ... up to the ,, so the expected output is a list:

[r'"url" : "a"', r'"url" : "b"']

I am using:

re.findall(r'("url"):(.*),', s)

but all it does is to return the entire string. Is there something i am doing wrong?

Ammar Aslam · Accepted Answer · 2021-11-19 14:48:44Z

3

Your last "," was beeing matched due to a greedy search, (.*?) is non greedy. Also the last comma is optional so that needs to be ignored if not present

import re

s = r'"url":"a","meta":"b","url":"c"'

print(re.findall(r'("url"):"(.*?)",?', s))

edited Nov 19, 2021 at 14:48

answered Nov 19, 2021 at 9:17

Ammar Aslam

6704 silver badges17 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Sergio Lema · Accepted Answer · 2021-11-19 09:17:58Z

1

You must escape the , to avoid including the comma inside the group. Try this:

re.findall(r'(("url" :[^,]*),*)', s)

answered Nov 19, 2021 at 9:17

Sergio Lema

1,6492 gold badges16 silver badges27 bronze badges

Collectives™ on Stack Overflow

Using regex to extract substrings

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related