Regex execution time analysis

Question

Some of regular expression have exponential time of execution due to bad syntax and non-obvious details. Is there any common way to analyze and learn if some regular expression have linear or exponential execution time?

apparently Regex is evil

uhoh
– uhoh

2018-02-09 06:45:50 +00:00
Commented Feb 9, 2018 at 6:45 — uhoh
– uhoh, Commented Feb 9, 2018 at 6:45
@uhoh, yes - now I know it for sure

user6416335
– user6416335

2018-02-09 08:01:22 +00:00
Commented Feb 9, 2018 at 8:01 — user6416335
– user6416335, Commented Feb 9, 2018 at 8:01
Thanks for the warning, I'm going to avoid it for now. ;-)

uhoh
– uhoh

2018-02-09 08:02:07 +00:00
Commented Feb 9, 2018 at 8:02 — uhoh
– uhoh, Commented Feb 9, 2018 at 8:02

Sobrique · Accepted Answer · 2016-06-08 10:26:44Z

4

I tend to just use perl and switch on use re 'debug'; before doing a regex operation.

This prints the steps the regex is going through to process, and quickly gives an idea of efficiency.

There's no hard and fast rules - the big warning sign I look for is whether this regex will need to backtrack. See: Catastrophic Backtracking

This can happen more easily when you're using lookahead/lookbehind (but doesn't have to).

In the grand scheme of things though - it pays to remember that whilst regex is a programming language, it's starting point is as a power search-and-replace. And thus implementing complicated logic in it, means you're creating code that's hard to maintain and debug - and so you shouldn't.

One of the useful tricks in perl - it can run in much the same way as sed/grep/awk using command line.

So you can enable regex debugging, and then do 'sed style':

perl -pe 's/search/replace' somefile

But then you can add 'debug' regex:

perl -Mre=debug -pe 's/search/replace/' somefile

Which will debug it whilst you're going.

edited Jun 8, 2016 at 10:26

answered Jun 8, 2016 at 9:27

Sobrique

53.6k8 gold badges63 silver badges107 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Lucas Trzesniewski Over a year ago

Catastrophic backtracking is the most serious offender - beware of nested quantifiers.

Cobus Kruger Over a year ago

I think I just discovered a reason to use Perl :)

Collectives™ on Stack Overflow

Regex execution time analysis

1 Answer 1

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related