How to modify this string in Python

Question

I have a string in which I need to add a '\' in front of every '[' or ']', except if the brackets enclose an x like this: '[x]'. In the other cases, the brackets will always enclose a number.

Example: 'Foo[123].bar[x]' should become 'Foo\[123\].bar[x]'.

What is the best way to achieve this? Thanks a lot on beforehand.

Ah, SO.. where you can get +4 for "give me the codez" these days. — Wooble
– Wooble, Commented Aug 16, 2012 at 23:11
I learned from the answers below. That's the idea, right? Thanks to all who helped me out. Picking the answer is difficult though. I'll give it to the regex way as I learned more from it. — saroele
– saroele, Commented Aug 17, 2012 at 8:08

g.d.d.c · Accepted Answer · 2012-08-16 21:33:40Z

8

Something like this ought to work:

>>> import re
>>>
>>> re.sub(r'\[(\d+)\]', r'\[\1\]', 'Foo[123].bar[x]')
'Foo\\[123\\].bar[x]'

answered Aug 16, 2012 at 21:33

g.d.d.c

48.3k12 gold badges105 silver badges116 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

dsign Over a year ago

This is a good solution (just compile the regex beforehand). It might happen that it be slightly slower than three chained replaces, because a regex engine implies some overhead. But in any case, this solution is easier to understand and would be easier to modify if the requirements change in the future.

saroele Over a year ago

I'm a regex newbie, but with a little help from the web I could figure out why this also works. Thanks, it's worth to learn a bit more of regex I see.

Honest Abe Over a year ago

@saroele I recommend this tutorial.

cmh · Accepted Answer · 2012-08-16 21:33:57Z

6

You can do it without reaching for regexs like this:

s.replace('[', '\[').replace(']', '\]').replace('\[x\]', '[x]')

answered Aug 16, 2012 at 21:33

cmh

11k5 gold badges33 silver badges40 bronze badges

7 Comments

g.d.d.c Over a year ago

Three full scans of the result string to achieve the desired output? For long strings this will perform poorly.

cmh Over a year ago

Sure, and in that case I'd advocate the regex approach. I was presenting (arguably) a conceptually simpler approach for when this isn't an issue. If performance is an issue, a single linear scan (with one lookahead) would outperform regex anyway.

DSM Over a year ago

On the test string 'Foo[123].bar[x]zzzzz'*100000, 2M characters long, I find that re.sub takes 250 ms (even making sure the pattern is compiled) whereas the replace only takes 36 ms.

Steven Rumbalski Over a year ago

@g.d.d.c: It's O(n). The regex is probably the best tool for the job, but this is acceptable.

saroele Over a year ago

The strings I need to convert are maximally 100 characters long. So it seems like this solutions will be the fastest

|

FailedDev · Accepted Answer · 2012-08-16 21:38:09Z

A different approach, just put a slash before [] only if they aren't followed by x] or preceded by [x .

result = re.sub(r"(\[(?!x\])|(?<!\[x)\])", r"\\\1", subject)

Explanation:

# (\[(?!x\])|(?<!\[x)\])
# 
# Match the regular expression below and capture its match into backreference number 1 «(\[(?!x\])|(?<!\[x)\])»
# Match either the regular expression below (attempting the next alternative only if this one fails) «\[(?!x\])»
# Match the character “[” literally «\[»
# Assert that it is impossible to match the regex below starting at this position (negative lookahead) «(?!x\])»
# Match the character “x” literally «x»
# Match the character “]” literally «\]»
# Or match regular expression number 2 below (the entire group fails if this one fails to match) «(?<!\[x)\]»
# Assert that it is impossible to match the regex below with the match ending at this position (negative lookbehind) «(?<!\[x)»
# Match the character “[” literally «\[»
# Match the character “x” literally «x»
# Match the character “]” literally «\]»

Collectives™ on Stack Overflow

How to modify this string in Python

3 Answers 3

3 Comments

7 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

3 Comments

7 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related