Update row values in a dataframe based on another row's values?

Question

I have a dataframe with two columns: a and b

df

         a         b
0       john      123
1       john
2       mark     
3       mark      456
4       marcus    789

I want to update values of b column based on a column.

         a         b
0       john      123
1       john      123
2       mark      456
3       mark      456
4       marcus    789

If john has value 123 in b. Remaining john also must have same value.

Shubham Sharma · Accepted Answer · 2020-05-21 10:25:02Z

1

Assuming your dataframe is:

df = pd.DataFrame({'a': ['john', 'john', 'mark', 'mark', 'marcus'], 'b': [123, '', '', 456, 789]})

You can df.groupby the dataframe on column a and then apply transform on the column b of the grouped dataframe returning the first non empty value in the grouped column b.

Use:

df['b'] = (
    df.groupby('a')['b']
    .transform(lambda s: s[s.ne('')].iloc[0] if s.ne('').any() else s)
)

Result:

# print(df)

        a    b
0    john  123
1    john  123
2    mark  456
3    mark  456
4  marcus  789

edited May 21, 2020 at 10:25

answered May 21, 2020 at 10:10

Shubham Sharma

71.8k6 gold badges26 silver badges58 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

user11509999 Over a year ago

Thanks. But if last row's 'b' value Is empty. marcus is ` `. Then what's should be the case

Shubham Sharma Over a year ago

@Zanthoxylumpiperitum Well in that case i have to modify the answer.

Shubham Sharma Over a year ago

@Zanthoxylumpiperitum What value do you expect in column b in such case?

user11509999 Over a year ago

Same as the original value

Shubham Sharma Over a year ago

@Zanthoxylumpiperitum Check the edited answer. Now this should work in such case.

PYPA_AI · Accepted Answer · 2020-05-21 10:03:58Z

0

Example:

df = pd.DataFrame({'A': [0," ", 2, 3, 4],
                   'B': [5, 6, 7, 8, 9],
                   'C': ['a', 'b', 'c', 'd', 'e']})

df1=df.replace({'A':" "},3)

Hope this helps, In your case it would be like

df1=df.replace({'b':" "},123)

answered May 21, 2020 at 10:03

PYPA_AI

391 silver badge8 bronze badges

Collectives™ on Stack Overflow

Update row values in a dataframe based on another row's values?

2 Answers 2

5 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

5 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related