comparing two Dataframe columns to check if they have same value in python

Question

I have two dataframes,

new1.
      Name       city
 0    sri won    chn
 1    pechi won  pune
 2    Ram won    mum
 0    pec won    kerala

new3
    req
0   pec
1   mut

I tried,

mask=new1.Name.str.contains("|".join(new3.req.values.tolist()))
new1[mask]

I am getting,

 new1[mask]
      Name       city
 1  pechi won    pune
 0  pec won      kerala

As "pechi" contains "pec", it took this valu. but I want the exact match between the values not "contains"

my desired output is,

 new1[mask]
      Name       city
 0  pec won      kerala

jezrael · Accepted Answer · 2017-08-09 06:37:26Z

1

You need \b that means "word boundary":

a = r'\b(' + "|".join(new3.req.values.tolist()) + r')\b'
print (a)
\b(pec|mut)\b

mask=new1.Name.str.contains(a)
df = new1[mask]
print (df)
      Name    city
0  pec won  kerala

answered Aug 9, 2017 at 6:37

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Pyd Over a year ago

wow!, It worked perfectly, could you please explain what the first line of code does ???

jezrael Over a year ago

You can check this for explain word boundary (my English is horrible, especially for deep explanations)

Zero · Accepted Answer · 2017-08-09 07:19:42Z

0

You need space in separator

In [1350]: new1
Out[1350]:
        Name    city
0    sri won     chn
1  pechi won    pune
2    Ram won     mum
0    pec won  kerala

In [1351]: new3
Out[1351]:
   req
0  pec
1  mut

In [1352]: ' | '.join(new3.req)
Out[1352]: 'pec | mut'

In [1353]: new1.Name.str.contains(' | '.join(new3.req))
Out[1353]:
0    False
1    False
2    False
0     True
Name: Name, dtype: bool

In [1354]: new1[new1.Name.str.contains(' | '.join(new3.req))]
Out[1354]:
      Name    city
0  pec won  kerala

edited Aug 9, 2017 at 7:19

answered Aug 9, 2017 at 6:32

Zero

77.4k22 gold badges153 silver badges153 bronze badges

2 Comments

Pyd Over a year ago

It gives the same result :(

Zero Over a year ago

You can check the example flow now.

Collectives™ on Stack Overflow

comparing two Dataframe columns to check if they have same value in python

2 Answers 2

2 Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related