Filter a dataframe using two conditions in python

Question

I want to filter a dataframe using two different condition.

But I want to omit rows which doesn't satisfy the condition and only want to keep values which occur at least twice in column A

df1 = df[(df['A-B occurrence'] >= 3) & (df['A occurrence'] >= 2)]

Above is the code I am using and this is the output I get:

So as in column A, 17 is satisfying condition in one row only so I want to omit 17 all together as it is not meeting the condition, which means I only want to keep duplicate values which are present in column A 2 or more than 2 times

"so as in coloumn A, 17 is satisfying condition in one row only" is not True. 'A occurrence' >= 2) — It_is_Chris
– It_is_Chris, Commented Oct 28, 2021 at 14:09
I don't fully understand what you mean. Do you want an 'or' statement rather than an `and'? — Quixotic22
– Quixotic22, Commented Oct 28, 2021 at 14:10
I think OP wants to keep only the duplicated As (see my answer) — mozway
– mozway, Commented Oct 28, 2021 at 14:13

mozway · Accepted Answer · 2021-10-28 14:11:20Z

1

IIUC you want to keep only the rows for which A has duplicates.

You can use:

df2 = df1[df1['A'].duplicated(keep=False)]

output: this should remove rows with index 14 (A=17) and 19 (A=19)

NB. you can apply the same strategy on the other columns if needed

answered Oct 28, 2021 at 14:11

mozway

267k13 gold badges56 silver badges106 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Filter a dataframe using two conditions in python

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related