Split number from string in Python [duplicate]

Question

If I would like to split the string from the number of the sentence: "It was amazing in 2016"

I use:

re.split('\s*((?=\d+))
out: 'It was amazing in', '2016'

Now I would like to do the opposite, so if a sentence starts with a number, then followed by a string like: '2016 was amazing'

I would like the result to be: '2016', 'was amazing'

You'll benefit from this tutorial on regular expressions. Show us what you've tried and where you're stuck. — Arya McCarthy
– Arya McCarthy, Commented Apr 10, 2017 at 18:10

anubhava · Accepted Answer · 2017-04-10 18:11:42Z

5

Using lookarounds you can use a single regex for both cases:

\s+(?=\d)|(?<=\d)\s+

Code:

>>> str = "It was amazing in 2016"
>>> re.split(r'\s+(?=\d)|(?<=\d)\s+', str)
['It was amazing in', '2016']

>>> str = "2016 was amazing"
>>> re.split(r'\s+(?=\d)|(?<=\d)\s+', str)
['2016', 'was amazing']

RegEx Breakup:

\s+ - Match 1 or more whitespaces
(?=\d) - Lookbehind that asserts next character is a digit
| - OR
(?<=\d) - Lookbehind that asserts previous character is a digit
\s+ - Match 1 or more whitespaces

answered Apr 10, 2017 at 18:11

anubhava

790k67 gold badges603 silver badges671 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

sachinruk Over a year ago

This regex doesn't work for something like str = 'Surface Pro5' where I hoped it would split at the 5. Would be extremely grateful if you added this scenario too.

anubhava Over a year ago

@sachinruk: You may use: filter(None, re.split(r'(\D+)(?=\d)|(?<=\d)(\D+)', str)) OR re.findall(r'\d+|\D+', str)

Dalvenjia · Accepted Answer · 2017-04-10 18:19:47Z

0

In my opinion RegEx is an overkill for that task, so unless you already are using RegEx on your program or it's required (assignment or otherwise), I recommend some string manipulation functions to get what you want.

def ends_in_digit(my_string):
    separated = my_string.rsplit(maxsplit=1)
    return separated if separated[-1].isdigit() else False

def starts_with_digit(my_string):
    separated = my_string.split(maxsplit=1)
    return separated if separated[0].isdigit() else False

answered Apr 10, 2017 at 18:19

Dalvenjia

2,0731 gold badge15 silver badges17 bronze badges

Comments

Wiktor Stribiżew · Accepted Answer · 2017-04-10 18:54:13Z

0

Another way to easily split into digits and non-digits is to match with \d+|\D+ regex. It will yield chunks with leading/trailing whitespaces though, but they can easily be removed (or kept if that is not important):

import re
r = re.compile(r'\d+|\D+')
ss = [ 'It was amazing in 2016', '2016 was amazing']
for s in ss:
    print(r.findall(s)) # to get chunks with leading/trailing whitespace
    print([x.strip() for x in r.findall(s)]) # no  leading/trailing whitespace

See the Python demo.

answered Apr 10, 2017 at 18:54

Wiktor Stribiżew

631k41 gold badges502 silver badges632 bronze badges

Collectives™ on Stack Overflow

Split number from string in Python [duplicate]

3 Answers 3

2 Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

2 Comments

Comments

Comments

Linked

Related