modify strings in pandas dataframe column

Question

I want to make all strings lower case and remove the whitespaces at the beginning and end of the strings.

df = pandas.DataFrame(data=[1,2,3,'A'],columns=['A'])
df['A'] = numpy.where(
    df['A'].apply(lambda x: isinstance(x, str)),
    df['A'].str.lower().str.strip(),
    df['A'],
)

The problem is that the code above fails if none of the rows is a string.

df = pandas.DataFrame(data=[1,2,3],columns=['A'])
AttributeError: Can only use .str accessor with string values, which use np.object_ dtype in pandas

Is there a better way to do this than

for index in df['A'].index:
    if isinstance(df['A'].iloc[index], str):
        df['A'].iloc[index] = df['A'].iloc[index].str.lower().str.strip()

Use df['A']=df['A'].apply(lambda x: x.lower().strip() if isinstance(x, str) else x) — Tarifazo
– Tarifazo, Commented Feb 21, 2019 at 13:57

Tarifazo · Accepted Answer · 2019-02-21 14:07:45Z

5

Assuming you want to leave your non strings untouched, you can use:

df['A']=df['A'].apply(lambda x: x.lower().strip() if isinstance(x, str) else x)

answered Feb 21, 2019 at 14:07

Tarifazo

4,3631 gold badge11 silver badges24 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

modify strings in pandas dataframe column

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related