How to split every string in a list in a dataframe column

Question

I have a dataframe with a column containing a list of strings 'A:B'. I'd like to modify this so there is a new column which contains a set split by ':' containing the first element.

data = [
    {'Name': 'A', 'Servers':['A:s1', 'B:s2', 'C:s3', 'C:s2']},
    {'Name': 'B', 'Servers':['B:s1', 'C:s2', 'B:s3', 'A:s2']},
    {'Name': 'C', 'Servers':['G:s1', 'X:s2', 'Y:s3']} 
]

df = pd.DataFrame(data)
df

df['Clusters'] = [
    {'A', 'B', 'C'},
    {'B', 'C', 'A'},
    {'G', 'X', 'Y'}
]

What do you want the results to look like? What have you tried? — piRSquared
– piRSquared, Commented Jul 4, 2019 at 20:56
it should be the same dataframe with the column 'Clusters' added. 'Clusters' contains a set of the first element from 'Servers' split at ':'. — Evan Brittain
– Evan Brittain, Commented Jul 4, 2019 at 21:02

Karthik V · Accepted Answer · 2019-07-04 21:02:28Z

1

Learn how to use apply

  In [5]: df['Clusters'] = df['Servers'].apply(lambda x: {p.split(':')[0] for p in x})                                                                                  

  In [6]: df                                                                                                                                                         
  Out[6]: 
    Name                   Servers   Clusters
  0    A  [A:s1, B:s2, C:s3, C:s2]  {A, B, C}
  1    B  [B:s1, C:s2, B:s3, A:s2]  {C, B, A}
  2    C        [G:s1, X:s2, Y:s3]  {X, Y, G}

answered Jul 4, 2019 at 21:02

Karthik V

1,8971 gold badge16 silver badges24 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

How to split every string in a list in a dataframe column

1 Answer 1

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related