how to regex a value of a specific key in python

Question

I have a long string with key values in this format:

"info":"infotext","day":"today","12":"here","info":"infotext2","info":"infotext3"

I want to get the value (=infotexts) of all "info" keys. How can this be done?

This data looks like your previous post - Are you now trying to parse that data without json - if so, why? — Jon Clements
– Jon Clements, Commented Nov 17, 2012 at 20:06

georg · Accepted Answer · 2012-11-17 20:00:34Z

4

Use the json, Luke

s = '"info":"infotext","day":"today","12":"here","info":"infotext2","info":"infotext3"'

import json

def pairs_hook(pairs):
    return [val for key, val in pairs if key == 'info']

p = json.loads('{' + s + '}', object_pairs_hook=pairs_hook)
print p # [u'infotext', u'infotext2', u'infotext3']

From the docs:

object_pairs_hook is an optional function that will be called with the result of any object literal decoded with an ordered list of pairs. The return value of object_pairs_hook will be used instead of the dict.

Just for the sake of completeness, here's a regular expression that does the same:

rg = r'''(?x)

    "info"
    \s* : \s*
    "
        (
            (?:\\.|[^"])*
        )
    "
'''
re.findall(rg, s) # ['infotext', 'infotext2', 'infotext3']

This also handles spaces around : and escaped quotes inside strings, like e.g.

 "info"  :   "some \"interesting\" information"

edited Nov 17, 2012 at 20:00

answered Nov 17, 2012 at 19:41

georg

216k57 gold badges324 silver badges401 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Martin Ender · Accepted Answer · 2012-11-17 19:37:54Z

3

As long as your infotext does not contain (escaped) quotes, you could try something like this:

>>> m = re.findall(r'"info":"([^"]+)', str)
>>> m
['infotext', 'infotext2', 'infotext3']

We simply match "info":" and then as many non-" characters as possible (which are captured and thus returned).

answered Nov 17, 2012 at 19:37

Martin Ender

44.4k11 gold badges93 silver badges132 bronze badges

Comments

burning_LEGION · Accepted Answer · 2012-11-17 19:38:41Z

0

use this regex (?<="info":")(.+?)(?=")

answered Nov 17, 2012 at 19:38

burning_LEGION

13.5k8 gold badges42 silver badges52 bronze badges

Comments

Ashwini Chaudhary · Accepted Answer · 2012-11-17 19:40:55Z

0

In [140]: import re

In [141]: strs='''"info":"infotext","day":"today","12":"here","info":"infotext2","info":"infotext3"'''

In [146]: [x.split(":")[-1].strip('"') for x in  re.findall(r'"info":"\w+"',strs)]
Out[146]: ['infotext', 'infotext2', 'infotext3']

answered Nov 17, 2012 at 19:40

Ashwini Chaudhary

252k60 gold badges478 silver badges519 bronze badges

Collectives™ on Stack Overflow

how to regex a value of a specific key in python

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related