Regular expression to filter list of strings matching a pattern

Question

I use R a lot more and it is easier for me to do it in R:

> test <- c('bbb', 'ccc', 'axx', 'xzz', 'xaa')
> test[grepl("^x",test)]
[1] "xzz" "xaa"

But how to do it in python if test is a list?

P.S. I am learning python using google's python exercise and I prefer using regular expression.

Wiktor Stribiżew · Accepted Answer · 2022-01-23 19:40:54Z

In general, you may use

import re                                  # Add the re import declaration to use regex
test = ['bbb', 'ccc', 'axx', 'xzz', 'xaa'] # Define a test list
reg = re.compile(r'^x')                    # Compile the regex
test = list(filter(reg.search, test))      # Create iterator using filter, cast to list 
# => ['xzz', 'xaa']

Or, to inverse the results and get all items that do not match the regex:

list(filter(lambda x: not reg.search(x), test))
# >>> ['bbb', 'ccc', 'axx']

See the Python demo.

USAGE NOTE:

re.search finds the first regex match anywhere in a string and returns a match object, otherwise None
re.match looks for a match only at the string start, it does NOT require a full string match. So, re.search(r'^x', text) = re.match(r'x', text)
re.fullmatch only returns a match if the full string matches the pattern, so, re.fullmatch(r'x') = re.match(r'x\Z') = re.search(r'^x\Z').

If you wonder what the r'' prefix means, see Python - Should I be using string prefix r when looking for a period (full stop or .) using regex? and Python regex - r prefix.

Abhijit · Accepted Answer · 2013-03-14 07:07:10Z

6

You can use the following to find if any of the strings in list starts with 'x'

>>> [e for e in test if e.startswith('x')]
['xzz', 'xaa']
>>> any(e.startswith('x') for e in test)
True

edited Mar 14, 2013 at 7:07

answered Mar 14, 2013 at 7:01

Abhijit

64k20 gold badges143 silver badges209 bronze badges

5 Comments

lokheart Over a year ago

I want to have the string started with "x" to be extracted, but I can't see your answer can give the output I expect.

lokheart Over a year ago

can I use re.match or similar function in re library instead?

squiguy Over a year ago

@lokheart You could definitely use re.match in place of starswith in the list comprehension above.

lokheart Over a year ago

@squiguy tried [x for x in test if re.match("^x",x)] and it works

squiguy Over a year ago

@lokheart Cool :). Have fun with Python!

squiguy · Accepted Answer · 2013-03-14 07:15:51Z

2

You could use filter. I am assuming you want a new list with certain elements from the old one.

new_test = filter(lambda x: x.startswith('x'), test)

Or if you want to use a regular expression in the filter function you could try the following. It requires the re module to be imported.

new_test = filter(lambda s: re.match("^x", s), test)

edited Mar 14, 2013 at 7:15

answered Mar 14, 2013 at 7:04

squiguy

33.7k8 gold badges63 silver badges67 bronze badges

Comments

justadev · Accepted Answer · 2021-02-08 20:45:06Z

1

An example when you want to extract more than one datapoint from each string in the list:

Input:

2021-02-08 20:43:16 [debug] : [RequestsDispatcher@_execute_request] Requesting: https://test.com&uuid=1623\n

Code:

pat = '(.* \d\d:\d\d:\d\d) .*_execute_request\] (.*?):.*uuid=(.*?)[\.\n]'
new_list = [re.findall(pat,s) for s in my_list]

Output:

[[('2021-02-08 20:43:15', 'Requesting', '1623')]]

answered Feb 8, 2021 at 20:45

justadev

1,5464 gold badges25 silver badges46 bronze badges

Comments

user6793824 · Accepted Answer · 2020-08-10 15:17:05Z

0

Here is some improvisation that works fine. Probably helps..

import re
l= ['bbb', 'ccc', 'axx', 'xzz', 'xaa'] #list
s= str( " ".join(l))                   #flattening list to string
re.findall('\\bx\\S*', s)               #regex to find string starting with x

['xzz', 'xaa']

answered Aug 10, 2020 at 15:17

user6793824

1412 silver badges3 bronze badges

Collectives™ on Stack Overflow

Regular expression to filter list of strings matching a pattern

5 Answers 5

Comments

5 Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

5 Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related