Group by example from SQL to pandas/python

Question

Let's say I am trying to find how many duplicates I have for a pair of values in a table. The columns are "A" and "B" I can do

select A, B, count(*) as counter from table group by A, B

In fact, I could also do

select A, B from (select A, B, count(*) as counter from table group by A, B) where counter >= 2

to only deal with values that have n duplicates.

How can I do the same in pandas?

I can do

df.groupby(["A", "B"].count(),

but that gives me every element, I only want to limit to those where count>=2

For example if I have:

I want to identify the first two columns because groupby() gives count of 2 (the pair (x,a) is repeated). I would like to do the same for any value, not just 2.

Can you show us some sample data ?

BENY
– BENY

2019-04-25 02:32:40 +00:00
Commented Apr 25, 2019 at 2:32 — BENY
– BENY, Commented Apr 25, 2019 at 2:32

BENY · Accepted Answer · 2019-04-25 02:49:51Z

1

Seems like you can do filter after groupby

df.groupby(["A", "B"])['A'].count().loc[lambda x : x>2]

Update duplicated

df[df.duplicated(['A','B'],keep=False)]
Out[1178]: 
   A  B  C
0  x  a  1
1  x  a  1

transform for different n

n=2

df[df.groupby(['A','B'])['A'].transform('count')==n]

edited Apr 25, 2019 at 2:49

answered Apr 25, 2019 at 2:34

BENY

324k22 gold badges176 silver badges250 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

user Over a year ago

What if I want the count =3, or something larger?

user Over a year ago

Thank you! Why do I need ['A"] before .transform? What does that do? Can I put any column?

BENY Over a year ago

@user that is just require for count , you need something to count .

user Over a year ago

So, I guess there is no option like * in sql? What if my column had a null in it?

BENY Over a year ago

@user if you need count all you can df.groupby(['A','B']).transform('count')

|

Collectives™ on Stack Overflow

Group by example from SQL to pandas/python

1 Answer 1

6 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

6 Comments

Your Answer

Sign up or log in

Post as a guest

Related