Pandas apply function to each row by calculating multiple columns

Question

I have been stacked by an easy question, and my question title might be inappropriate.

df = pd.DataFrame(list(zip(['a', 'a', 'b', 'b', 'c', 'c', 'c'], 
                           ['a1', 'a2', 'b1', 'b2', 'c1', 'c2', 'c3'],
                           [110, 80, 100, 180, 12], 
                           [5, 7, 2, 6, 10])), 
                      columns=['name', 'ingredient', 'amount', 'con'])

I want to calculate (df.amount * df.con)/df.groupby('name').agg({'amount':'sum'}).reset_index().loc(df.name==i).amount) (Sorry, this line will return error, but what I want is to calculate total concentration (under each name) based on each ingredient amount and ingredient con.

Here is my code:

df['cal'] =df.amount * df.con
df = df.merge(df.groupby('name').agg({'amount':'sum'}).reset_index(),
              on = ['name'], how = 'left', suffixes = (None, '_y'))
df['what_i_want'] = df['cal']/df['amount_y']
df.groupby('name').what_i_want.sum()

output:

name
a     5.842105
b     4.571429
c    10.000000
Name: what_i_want, dtype: float64

Any short-cut for this calculation?

Thanks ahead.

mozway · Accepted Answer · 2022-06-11 16:34:03Z

2

IIUC, you can use:

out = (df
 .groupby('name')
 .apply(lambda g: g['amount'].mul(g['con']).sum()/g['amount'].sum())
)

output:

name
a     5.842105
b     4.571429
c    10.000000
dtype: float64

answered Jun 11, 2022 at 16:34

mozway

267k13 gold badges55 silver badges106 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

SultanOrazbayev · Accepted Answer · 2022-06-11 16:33:20Z

1

To shortcut the operations (esp. remove the merge), you can use groupy.transform, which will retain the original index:

df["what_i_want_2"] = (df["amount"] * df["con"]) / (
    df.groupby("name")["amount"].transform("sum")
)

answered Jun 11, 2022 at 16:33

SultanOrazbayev

16.7k3 gold badges24 silver badges59 bronze badges

Collectives™ on Stack Overflow

Pandas apply function to each row by calculating multiple columns

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related