Dealing with columns containing nested json using python pandas

Question

I have a pandas df with various columns. One column - myCol - looks like this:

df

someCol   myCol
a         [{}]
b         [{'X': {'A': "value", 'B': "value"}}]
c         [{}, {}]
d         [{'X': {'A': "value", 'B': "value", 'C': "value"}}]

The maximum number of key-val pairs in X is unknown: some rows contain them all, some only contain a selection, and some are empty. I would like to replace myCol with actual columns, with as many columns as needed depending on the unique number of key-val pairs in X. So in this particular example, I would end up with:

df

someCol   A       B       C
a         N/A     N/A     N/A
b         value   value   N/A     
c         N/A     N/A     N/A
d         value   value   value

I am struggling in coming up with a general way to solve this, which is needed since I don't know how many 'additional' columns I will need in the end. Any ideas would be much appreciated.

Hi please check out pandas.read_json: pandas.pydata.org/pandas-docs/stable/reference/api/… And edit your question if you still need help — eva-vw
– eva-vw, Commented Jan 27, 2020 at 14:14

jezrael · Accepted Answer · 2020-01-27 14:16:26Z

2

Solution return first lists and dictionary with key X, then convert Nones to empty dicts and last pass to DataFrame constructor:

d = [{} if pd.isna(x) else x for x in df.pop('myCol').str[0].str.get('X')]
df = df.join(pd.DataFrame(d, index=df.index))
print (df)
  someCol      A      B      C
0       a    NaN    NaN    NaN
1       b  value  value    NaN
2       c    NaN    NaN    NaN
3       d  value  value  value

answered Jan 27, 2020 at 14:16

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

CHRD Over a year ago

Thank you. If the values are True or False instead of "value", how can I adapt your answer to that?

jezrael Over a year ago

@CHRD - I think no change.

CHRD Over a year ago

True, that didn't matter. Thanks alot!

Collectives™ on Stack Overflow

Dealing with columns containing nested json using python pandas

1 Answer 1

3 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

3 Comments

Your Answer

Sign up or log in

Post as a guest

Related