replacing using regex python

Question

I have a sentence like this

s = " zero/NN  divided/VBD  by/IN  anything/NN is zero/NN"

I need to replace all the words with tags to just tags . Output should be

s = "NN VBD IN NN is NN"

I tried using regex replace like this

tup = re.sub( r"\s*/$" , "", s)

but this is not giving me the correct output . Please help

stema · Accepted Answer · 2012-02-03 07:14:28Z

3

This gives the output you want:

tup = re.sub( r"\b\w+/" , "", s)

\b is matching a word boundary, followed by \w+ at least one word character (a-zA-Z0-9_) and at least the slash.

answered Feb 3, 2012 at 7:14

stema

93.5k20 gold badges110 silver badges138 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

avasal · Accepted Answer · 2012-02-03 07:07:38Z

2

try:

tup = re.sub( r"[a-z]*/" , "", s)

In [1]: s = " zero/NN divided/VBD by/IN anything/NN is zero/NN"
In [2]: tup = re.sub( r"[a-z]*/" , "", s)
In [3]: print tup
 NN VBD IN NN is NN

answered Feb 3, 2012 at 7:07

avasal

14.9k4 gold badges33 silver badges49 bronze badges

1 Comment

dheeraj Over a year ago

This is pretty much the same as the first answer but this can be modified as [A-z] to change uppercase letters also .

Lukáš Lalinský · Accepted Answer · 2012-02-03 07:07:18Z

0

The \s character group matches all whitespace characters, which doesn't seem what you want. I think you want the other case, all non-whitespace characters. You can also be more specific on what is a tag, for example:

tup = re.sub( r"\S+/([A-Z]+)" , r"\1", s)

This replaces all non-whitespace characters, followed by a slash and then a sequence of uppercase letters with just the uppercase letters.

answered Feb 3, 2012 at 7:07

Lukáš Lalinský

41.5k6 gold badges109 silver badges128 bronze badges

Comments

stew · Accepted Answer · 2012-02-03 07:09:58Z

0

 tup = re.sub( r"\b\w+/(\w+)\b", r"\1", s)

on either side of my regex is \b meaning "word boundary", then on either side of "/" i have \w+ meaning "word characters". On the right we group them by putting them into parentheses.

The second expression r"\1" means. "the first group" which gets the stuff in parentheses.

answered Feb 3, 2012 at 7:09

stew

11.4k39 silver badges49 bronze badges

Collectives™ on Stack Overflow

replacing using regex python

4 Answers 4

Comments

1 Comment

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

1 Comment

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related