python regex with repeating subpattern

Question

I am wondering if there is a 'smart' way (one regex expression) to extract IDs from the following paragraph:

... imgList = '9/optimized/1260089_fpx.tif,0/optimized/1260090_fpx.tif'; ...

The result shoul be a list containing 1260089 and 1260090. The count of the IDs might be up to 10.

I need something like:

re.findall('imgList = (some expression)', string)

Any ideas?

Noctua · Accepted Answer · 2013-10-17 21:36:40Z

1

Best would be to use a single regex finding all the numbers. I call for re.findall

>>> imgList = '9/optimized/1260089_fpx.tif,0/optimized/1260090_fpx.tif'
>>> import re
>>> re.findall('optimized/([0-9]*)_fpx', imgList)
['1260089', '1260090']

You could of course make the regex stronger, but if the data is as you indicated, this should suffice.

edited Oct 17, 2013 at 21:36

answered Oct 17, 2013 at 11:50

Noctua

5,2381 gold badge20 silver badges23 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Ben · Accepted Answer · 2013-10-17 11:52:25Z

0

import re

s = '9/optimized/1260089_fpx.tif,0/optimized/1260090_fpx.tif'

print(re.findall(r'(\d+)_fpx.tif', s))

answered Oct 17, 2013 at 11:52

Ben

6933 silver badges5 bronze badges

Comments

Jakob · Accepted Answer · 2013-10-17 12:01:55Z

0

If the optimzed/ an _fpx part is not ensured and the ID is between 7 and 10 digits you could do something like

import re
re.findall('[\d]{7,10}', imgList)

This will find a 7 to 10 digit number in the string, hence, IDs with 0-6 or more than 10 digits will be excluded.

answered Oct 17, 2013 at 12:01

Jakob

21k8 gold badges81 silver badges99 bronze badges

1 Comment

user2890231 Over a year ago

This approach is not what I was looking for, but might get the job done. Thanks!

YaleCheung · Accepted Answer · 2013-10-17 13:22:06Z

0

import re
imgList = '9/optimized/1260089_fpx.tif,0/optimized/1260090_fpx.tif'
re.findall(r'([0-9]){7}',imgList)

['1260089', '1260090']

The code can only meet your situation.

answered Oct 17, 2013 at 13:22

YaleCheung

6403 silver badges9 bronze badges

Collectives™ on Stack Overflow

python regex with repeating subpattern

4 Answers 4

Comments

Comments

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related