Remove part of string after a specific character in Python

Question

Given a time column as follows:

             time
0   2019Y8m16d10h
1    2019Y9m3d10h
2  2019Y9m3d10h58s
3    2019Y9m3d10h

How can I remove substrings start by d, I have tried with df['time'].str.split('d')[0], but it doesn't work.

My desired result will like this. Thank you.

        time
0    2019Y8m16d
1    2019Y9m3d
2    2019Y9m3d
3    2019Y9m3d

jezrael · Accepted Answer · 2020-03-05 08:31:49Z

1

You are close, need str[0] for select lists and then add d:

df['time'] = df['time'].str.split('d').str[0].add('d')

Or:

df['time'] = df['time'].str.split('(d)').str[:2].str.join('')

print (df)
         time
0  2019Y8m16d
1   2019Y9m3d
2   2019Y9m3d
3   2019Y9m3d

df['time'] = df['time'].str.extract('(.+d)')
print (df)
         time
0  2019Y8m16d
1   2019Y9m3d
2   2019Y9m3d
3   2019Y9m3d

answered Mar 5, 2020 at 8:24

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Thank you, this works but d get missing as well and I want to keep it.

Valdi_Bo · Accepted Answer · 2020-03-05 08:30:20Z

1

One of possible solutions:

df['time'].str.extract(r'([^d]+d)')

answered Mar 5, 2020 at 8:30

Valdi_Bo

31.1k4 gold badges29 silver badges45 bronze badges

Raghul Raj · Accepted Answer · 2020-03-05 08:36:22Z

1

Or you can simply use apply functionality to solve the purpose as follows:

df.apply(lambda x: x['time'].split('d')[0]+'d',axis=1)

answered Mar 5, 2020 at 8:36

Raghul Raj

1,44811 silver badges24 bronze badges