Pandas Dataframe delete Row if string matches

Question

def clean_doc (df): 
 for rownum in range(0,df.shape[0]):
    if "LM_" not in df.iloc[rownum][6]:
        clean_df = df.drop([df.index[rownum]])
 return clean_df

I want to delete a row if it does not start with "LM_"

Also tried:

df.drop([rownum])

and many more, but it only deletes one line of my dataset.. but it should be a lot more

Dan · Accepted Answer · 2020-01-17 15:44:05Z

1

You could try:

df[df['<your_column>'].str.startswith('LM_')]

Example:

import pandas as pd

df = pd.DataFrame({'col':['abc', 'LM_abc']})

print(df[df['col'].str.startswith('LM_')])

Output:

      col
1  LM_abc

Your code is only deleting one line because you're overwriting the clean_df variable every time you loop.

answered Jan 17, 2020 at 15:44

Dan

1,5871 gold badge13 silver badges20 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Henry Yik Over a year ago

I think OP wants df[~df['<your_column>'].str.startswith('LM_')] instead.

Dan Over a year ago

@HenryYik I think it's the opposite, OPs code is deleting rows that do not contain LM_

Collectives™ on Stack Overflow

Pandas Dataframe delete Row if string matches

1 Answer 1

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related