Python - How to cut a string in Python?

Question

Suppose that I have the following string:

http://www.domain.com/?s=some&two=20

How can I take off what is after & including the & and have this string:

http://www.domain.com/?s=some

Claudiu · Accepted Answer · 2011-11-23 19:30:35Z

Well, to answer the immediate question:

>>> s = "http://www.domain.com/?s=some&two=20"

The rfind method returns the index of right-most substring:

>>> s.rfind("&")
29

You can take all elements up to a given index with the slicing operator:

>>> "foobar"[:4]
'foob'

Putting the two together:

>>> s[:s.rfind("&")]
'http://www.domain.com/?s=some'

If you are dealing with URLs in particular, you might want to use built-in libraries that deal with URLs. If, for example, you wanted to remove two from the above query string:

First, parse the URL as a whole:

>>> import urlparse, urllib
>>> parse_result = urlparse.urlsplit("http://www.domain.com/?s=some&two=20")
>>> parse_result
SplitResult(scheme='http', netloc='www.domain.com', path='/', query='s=some&two=20', fragment='')

Take out just the query string:

>>> query_s = parse_result.query
>>> query_s
's=some&two=20'

Turn it into a dict:

>>> query_d = urlparse.parse_qs(parse_result.query)
>>> query_d
{'s': ['some'], 'two': ['20']}
>>> query_d['s']
['some']
>>> query_d['two']
['20']

Remove the 'two' key from the dict:

>>> del query_d['two']
>>> query_d
{'s': ['some']}

Put it back into a query string:

>>> new_query_s = urllib.urlencode(query_d, True)
>>> new_query_s
's=some'

And now stitch the URL back together:

>>> result = urlparse.urlunsplit((
    parse_result.scheme, parse_result.netloc,
    parse_result.path, new_query_s, parse_result.fragment))
>>> result
'http://www.domain.com/?s=some'

The benefit of this is that you have more control over the URL. Like, if you always wanted to remove the two argument, even if it was put earlier in the query string ("two=20&s=some"), this would still do the right thing. It might be overkill depending on what you want to do.

Good advice. On Python 3 it is named urllib.parse: docs.python.org/3/library/urllib.parse.html

César · Accepted Answer · 2011-11-23 19:16:36Z

52

You need to split the string:

>>> s = 'http://www.domain.com/?s=some&two=20'
>>> s.split('&')
['http://www.domain.com/?s=some', 'two=20']

That will return a list as you can see so you can do:

>>> s2 = s.split('&')[0]
>>> print s2
http://www.domain.com/?s=some

answered Nov 23, 2011 at 19:16

César

10.1k6 gold badges55 silver badges76 bronze badges

Comments

jonathan.hepp · Accepted Answer · 2011-11-23 19:15:03Z

8

string = 'http://www.domain.com/?s=some&two=20'
cut_string = string.split('&')
new_string = cut_string[0]
print(new_string)

answered Nov 23, 2011 at 19:15

jonathan.hepp

1,6533 gold badges16 silver badges21 bronze badges

3 Comments

Claudiu Over a year ago

This won't work if there are any other ampersands in the URL.

jonathan.hepp Over a year ago

I answered the question like everyone, he didn't ask for an exception. So to do the example he gave, this is the most simple pythonic way to do it.

Claudiu Over a year ago

Ah that's true. I thought he asked how to take off the right-most ampersand in a string (so I was thinking would be better if you gave string.rsplit('&',1)), but he didn't ask that necessarily.

David Heffernan · Accepted Answer · 2011-11-23 19:34:35Z

5

You can use find()

>>> s = 'http://www.domain.com/?s=some&two=20'
>>> s[:s.find('&')]
'http://www.domain.com/?s=some'

Of course, if there is a chance that the searched for text will not be present then you need to write more lengthy code:

pos = s.find('&')
if pos != -1:
    s = s[:pos]

Whilst you can make some progress using code like this, more complex situations demand a true URL parser.

edited Nov 23, 2011 at 19:34

answered Nov 23, 2011 at 19:15

David Heffernan

616k46 gold badges1.1k silver badges1.5k bronze badges

Comments

Ben · Accepted Answer · 2011-11-23 19:17:13Z

2

>>str = "http://www.domain.com/?s=some&two=20"
>>str.split("&")
>>["http://www.domain.com/?s=some", "two=20"]

answered Nov 23, 2011 at 19:17

Ben

16.6k9 gold badges47 silver badges65 bronze badges

Comments

bigblind · Accepted Answer · 2011-11-23 19:19:57Z

1

s[0:"s".index("&")]

what does this do:

take a slice from the string starting at index 0, up to, but not including the index of &in the string.

answered Nov 23, 2011 at 19:19

bigblind

12.9k14 gold badges72 silver badges132 bronze badges

Comments

charly_0x13 · Accepted Answer · 2025-10-22 10:20:24Z

0

You can also do this cleanly with regular expressions if you need more flexibility in the separator:

import re

s = "http://www.example.com/?s=some&two=20"
result = re.split(r'&.*', s)[0]
print(result)
# http://www.example.com/?s=some

re.split(r'&.*', s) splits the string at the first & and everything after it.
It is particulary useful if your separator is more complex than a single character.

answered Oct 22 at 10:20

charly_0x13

1493 bronze badges

Collectives™ on Stack Overflow

Python - How to cut a string in Python?

7 Answers 7

1 Comment

Comments

3 Comments

Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

1 Comment

Comments

3 Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related