Detect numbers in string

Question

value = 'ad.41.bd'

if len(value) == len(value.strip({0,1,2,3,4,5,6,7,8,9})):
    # no numbers
else:
    # numbers present

There a cleaner way of detecting numbers in a string in Python?

Marcin · Accepted Answer · 2011-07-11 11:05:42Z

19

What about this?

import re
if not re.search('\d+', value):
    # no numbers
else:
    # numbers present

answered Jul 11, 2011 at 11:05

Marcin

241k16 gold badges315 silver badges368 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Rusty Rob · Accepted Answer · 2011-07-11 20:24:12Z

9

>>> value="ab3asdf"
>>> any(c.isdigit() for c in value)
True
>>> value="asf"
>>> any(c.isdigit() for c in value)
False




>>> value = 'ad.41.bd'
>>> any(map(lambda c:c.isdigit(),value))
True

EDIT:

>>> value="1"+"a"*10**6
>>> any(map(lambda c:c.isdigit(),value))
True
>>> from itertools import imap
>>> any(imap(lambda c:c.isdigit(),value))
True

map took 1 second (on old python) imap was instant because imap returns a generator. note often in the real world there is a higher probability of the number being at the end of the file name.

edited Jul 11, 2011 at 20:24

answered Jul 11, 2011 at 11:10

Rusty Rob

17.3k10 gold badges103 silver badges121 bronze badges

8 Comments

Felix Kling Over a year ago

It would be interesting to see how good (or bad) this performs compared to a regex.

Kirk Strauser Over a year ago

I like any, but map generates the entire list first. If value is huge and the first character is a digit, this code still processes the whole thing.

Rusty Rob Over a year ago

I believe any stops when it finds the first True. However map doesn't return a generator (apart from I think in python 3) so a whole list is created in memory (even if the first character is a digit). Otherwise I'm guessing it would be fairly similar.

mhyfritz Over a year ago

@robert king, w/o the parens you create a list of methods. This will always evaluate to True! Try it with value = 'abc'.

Steven Rumbalski Over a year ago

any(c.isdigit() for c in value) uses a generator expression and dispenses with the lambda and the map.

|

Kirk Strauser · Accepted Answer · 2011-07-11 18:05:44Z

4

from string import digits
def containsnumbers(value):
    return any(char in digits for char in value)

EDIT:

And just for thoroughness:

any(c.isdigit()):

>>> timeit.timeit('any(c.isdigit() for c in value)', setup='value = "abcd1"')
1.4080650806427002

any(c in digits):

>>> timeit.timeit('any(c in digits for c in value)', setup='from string import digits; value = "abcd1"')
1.392179012298584

re.search (1 or more digits):

>>> timeit.timeit("re.search('\d+', value)", setup='import re; value = "abcd1"')
1.8129329681396484

re.search (stop after one digit):

>>> timeit.timeit("re.search('\d', value)", setup='import re; value = "abcd1"')
1.599431037902832

re.match (non-greedy):

>>> timeit.timeit("re.match(r'^.*?\d', value)", setup='import re; value = "abcd1"')
1.6654980182647705

re.match(greedy):

>>> timeit.timeit("re.match(r'^.*\d', value)", setup='import re; value = "abcd1"')
1.5637178421020508

any(map()):

>>> timeit.timeit("any(map(lambda c:c.isdigit(),value))", setup='value = "abcd1"')
1.9165890216827393

any(imap()):

>>> timeit.timeit("any(imap(lambda c:c.isdigit(),value))", setup='from itertools import imap;value = "abcd1"')
1.370448112487793

Generally, the less complex regexps ran more quickly. c.isdigit() and c in digits are almost equivalent. re.match is slightly faster than re.search. map() is the slowest solution, but imap() was the fastest (but within rounding error of any(c.isdigit) and any(c in digits).

edited Jul 11, 2011 at 18:05

answered Jul 11, 2011 at 11:13

Kirk Strauser

31.1k5 gold badges53 silver badges69 bronze badges

2 Comments

Steven Rumbalski Over a year ago

I played with your timings and found that as the value increases in length the regexes perform better and better.

Kirk Strauser Over a year ago

@Steven: That seems reasonable as it shifts a lot of the work to C. I changed value to "abcd" * 1000 + "9" and decreased number to 10000. any(imap) took 7.588s, re.search (stop after first match) took 0.377s, and any(c in digits) took 4.283s. Rewriting the last as any(d in value for d in digits) took only .310s (and .073s when the last digit was "1" instead of "9"), again I suppose because it pushed most of the workload to off to core.

Felix Kling · Accepted Answer · 2011-07-11 11:05:18Z

3

You can use a regular expression:

import re
# or if re.search(r'\d', value):
if re.match(r'^.*?\d', value):
    # numbers present
else:
    # no numbers

answered Jul 11, 2011 at 11:05

Felix Kling

820k181 gold badges1.1k silver badges1.2k bronze badges

4 Comments

Kirk Strauser Over a year ago

You don't need to anchor the expression with ^ and would probably be better off with re.search. Also, .* matches zero or more characters so ? is redundant.

Felix Kling Over a year ago

@Kirk: I thought search might look for all locations that match this pattern (by I need only one), whereas match will only match at the beginning (so yes, I could omit ^ here). I want the shortest possible match, hence the .*? (not greedy).

Kirk Strauser Over a year ago

Good points. After I've slept a little and had some coffee, I'll re-evaluate. :-)

Steven Rumbalski Over a year ago

search give only the first match.

Steven Rumbalski · Accepted Answer · 2011-07-11 14:18:37Z

1

if not any(c.isdigit() for c in value)
    # no numbers
else:
    # numbers present

answered Jul 11, 2011 at 14:18

Steven Rumbalski

45.7k10 gold badges96 silver badges125 bronze badges

Comments

Darshan Chaudhary · Accepted Answer · 2015-10-06 07:34:31Z

1

To detect signs in the numbers, use the ? operator.

import re
if not re.search('-?\d+', value):
    # no numbers
else:
    # numbers present

answered Oct 6, 2015 at 7:34

Darshan Chaudhary

2,2433 gold badges25 silver badges44 bronze badges

Comments

mrbox · Accepted Answer · 2011-07-11 11:11:57Z

0

If you want to know how big is the difference, you can use re.sub()

import re
digits_num = len(value) - len(re.sub(r'\d','',value))
if not digits_num:
    #without numbers
else:
    #with numbers - or elif digist_num == 3

answered Jul 11, 2011 at 11:11

mrbox

8241 gold badge6 silver badges18 bronze badges

Collectives™ on Stack Overflow

Detect numbers in string

7 Answers 7

Comments

8 Comments

EDIT:

2 Comments

4 Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

Comments

8 Comments

EDIT:

2 Comments

4 Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related