How to append strings inside dataframe cells based on column values

Question

Given a dataframe:

import pandas as pd

df = pd.DataFrame(data= {'Col1': ['No', 'Yes', 'No', 'Maybe'], 'Col2': ['Yes', 'No', 'No', 'No'], 'Result': ''})

I want to populate Result with a list that may need to be appended based upon a column value. In this case, the parameters would be:

If the value is 'Yes' keep the current value of Result, if the value is 'Maybe' append 'Attention needed (insert column name)', if the value is 'No' append 'Failure (insert column name)'

Desired result:

What is the issue, exactly? Have you tried anything, done any research? Stack Overflow is not a free code writing service. See: How to Ask, help center, meta.stackoverflow.com/questions/261592/…. — AMC
– AMC, Commented Mar 11, 2020 at 21:12
Which part are you struggling with? Also, you wrote I want to populate Result with a list that may need to be appended based upon a column value, but it looks like you're doing so based on multiple columns. Is that what your data actually looks like? Can you provide some context for this? — AMC
– AMC, Commented Mar 11, 2020 at 21:26
@AMC I am not trying to append strings to a series that results in a string. I need to append strings to a series that results in a list within each cell, if that is even possible. — Trace R.
– Trace R., Commented Mar 11, 2020 at 21:28
Yes, I had misread the data, it's less than ideal but it's certainly possible. — AMC
– AMC, Commented Mar 11, 2020 at 21:28

Chris Adams · Accepted Answer · 2020-03-11 21:16:15Z

1

Not very pretty, but you could create a dict, then use stack, map and groupby with join aggregation:

d = {'No': 'Failure', 'Maybe': 'Attention needed'}
s = df[['Col1', 'Col2']].stack().map(d).dropna()

df['Result'] = (s + ' ' + s.index.get_level_values(1)).groupby(level=0).agg(', '.join)

[out]

    Col1 Col2                               Result
0     No  Yes                         Failure Col1
1    Yes   No                         Failure Col2
2     No   No           Failure Col1, Failure Col2
3  Maybe   No  Attention needed Col1, Failure Col2

answered Mar 11, 2020 at 21:16

Chris Adams

18.7k4 gold badges26 silver badges44 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Shadab Hussain · Accepted Answer · 2020-03-11 21:31:30Z

1

Try this one liner code using lambda function:

df['Result'] = df[['Col1','Col2']].apply(lambda x: 'Failure Col1' if (x[0]=='No' and x[1]=='Yes') else ('Failure Col2' if (x[1]=='No' and x[0]=='Yes') else ('Failure Col1, Failure Col2' if (x[0]=='No' and x[1]=='No') else("Attention needed Col1, Failure Col2" if (x[0]=='Maybe' and x[1]=='No') else None))), axis=1)

Output:


   Col1     Col2    Result
0   No      Yes     Failure Col1
1   Yes     No      Failure Col2
2   No      No      Failure Col1, Failure Col2
3   Maybe   No      Attention needed Col1, Failure Col2

edited Mar 11, 2020 at 21:31

answered Mar 11, 2020 at 21:21

Shadab Hussain

8348 silver badges26 bronze badges

Comments

Meto · Accepted Answer · 2020-03-11 21:03:52Z

0

You may first construct the result column as a numpy array while traversing the data frame columns and checking the values then you can add the new result column and drop the old one.

answered Mar 11, 2020 at 21:03

Meto

6568 silver badges19 bronze badges

Comments

Andy L. · Accepted Answer · 2020-03-11 22:50:23Z

Construct a dictionary to replace values in df and Using * and + to construct a series of appropriate message strings and finally join them and assign to df.Result

d = {'Yes': '', 'No': 'Failure ', 'Maybe': 'Attention needed '}
df1 = df[['Col1', 'Col2']]
df['Result'] = ((df1.replace(d) 
                + df1.ne('Yes').values * df1.columns.values).agg(','.join, axis=1)
                                                            .str.strip(','))

Or

df['Result'] = ((df1.replace(d) 
                + df1.ne('Yes').values * (df1.columns+',').values).sum(1)
                                                                  .str.strip(','))

Out[267]:
    Col1 Col2                              Result
0     No  Yes                        Failure Col1
1    Yes   No                        Failure Col2
2     No   No           Failure Col1,Failure Col2
3  Maybe   No  Attention needed Col1,Failure Col2

Here the detail

df1.replace(d) + df1.ne('Yes').values * df1.columns.values

Out[268]:
                    Col1          Col2
0           Failure Col1
1                         Failure Col2
2           Failure Col1  Failure Col2
3  Attention needed Col1  Failure Col2

((df1.replace(d) + df1.ne('Yes').values * df1.columns.values).agg(','.join, axis=1)
                                                             .str.strip(','))

Out[269]:
0                          Failure Col1
1                          Failure Col2
2             Failure Col1,Failure Col2
3    Attention needed Col1,Failure Col2
dtype: object

Collectives™ on Stack Overflow

How to append strings inside dataframe cells based on column values

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related