Create new Pandas boolean df based on values from list

Question

Suppose I have this df:

col1 col2 col3 col4
A     B     B    A
B     C     C    D
D    null   D   null

And a list

list1 = ["A","B","C","D"]

How do I create a new df with the boolean representation of the values of the list as first column if the value is in the old df columns?

Expected output:

list1 col1 col2 col3 col4
  A    1    0    0    1
  B    1    1    1    0
  C    0    1    1    0
  D    1    0    1    1

Georgina Skibinski · Accepted Answer · 2020-10-05 21:00:33Z

1

Try:

res = pd.DataFrame(index=list1, columns=df.columns).fillna(0)

res.loc[:, :] = df.stack().reset_index().pivot_table(index=0, columns="level_1", aggfunc="count").notna().astype(int).droplevel(0, axis=1)

Outputs:

>>> res

   col1  col2  col3  col4
A     1     0     0     1
B     1     1     1     0
C     0     1     1     0
D     1     0     1     1

answered Oct 5, 2020 at 21:00

Georgina Skibinski

13.5k2 gold badges16 silver badges44 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Quang Hoang · Accepted Answer · 2020-10-05 20:43:23Z

1

This is essentially crosstab:

df.melt().groupby('value')['variable'].value_counts().unstack(fill_value=0)

Output:

variable  col1  col2  col3  col4
value                           
A            1     0     0     1
B            1     1     1     0
C            0     1     1     0
D            1     0     1     1

answered Oct 5, 2020 at 20:43

Quang Hoang

151k11 gold badges64 silver badges86 bronze badges

Collectives™ on Stack Overflow

Create new Pandas boolean df based on values from list

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related