How can I check if a URL is absolute using Python?

Question

What is the preferred solution for checking if an URL is relative or absolute?

Kukanani · Accepted Answer · 2017-03-02 21:34:47Z

73

Python 2

You can use the urlparse module to parse an URL and then you can check if it's relative or absolute by checking whether it has the host name set.

>>> import urlparse
>>> def is_absolute(url):
...     return bool(urlparse.urlparse(url).netloc)
... 
>>> is_absolute('http://www.example.com/some/path')
True
>>> is_absolute('//www.example.com/some/path')
True
>>> is_absolute('/some/path')
False

Python 3

urlparse has been moved to urllib.parse, so use the following:

from urllib.parse import urlparse

def is_absolute(url):
    return bool(urlparse(url).netloc)

edited Mar 2, 2017 at 21:34

Kukanani

7681 gold badge6 silver badges22 bronze badges

answered Dec 2, 2011 at 14:05

Lukáš Lalinský

41.5k6 gold badges109 silver badges128 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

Geo Over a year ago

Shouldn't www.example.com/some/path count as abolute too?

Lukáš Lalinský Over a year ago

Officially, that's an relative URL with the whole string as path. If you want it to count as absolute, you would have to either add the http:// by some pre-processing or not use urlparse.

Nik Over a year ago

According to RFC //google.com is a protocol-relative url. And your code will return False for it.

Rockallite Over a year ago

I'd prefer urlsplit instead of urlparse. BTW, in Django you have a Python 2 & 3 compatible way: from django.utils.six.moves.urllib.parse import urlsplit, urlparse

Sean Over a year ago

@Nik not for me: In [27]: urlparse('//google.com') Out[27]: ParseResult(scheme='', netloc='google.com', path='', params='', query='', fragment='')

|

Bob Whitelock · Accepted Answer · 2021-03-16 07:43:57Z

32

If you want to know if an URL is absolute or relative in order to join it with a base URL, I usually do urllib.parse.urljoin anyway:

>>> from urllib.parse import urljoin
>>> urljoin('http://example.com/', 'http://example.com/picture.png')
'http://example.com/picture.png'
>>> urljoin('http://example1.com/', '/picture.png')
'http://example1.com/picture.png'
>>>

edited Mar 16, 2021 at 7:43

Bob Whitelock

1853 silver badges14 bronze badges

answered Dec 2, 2011 at 13:45

warvariuc

60.1k45 gold badges183 silver badges234 bronze badges

2 Comments

rescdsk Over a year ago

It turns out that this is what I wanted to do - it treats the first URL as the default for all unspecified parts of the second URL. If the second one is absolute, it just uses that one.

J. Taylor Over a year ago

Anyone using this should be aware that if given http://www.yahoo.com and www.google.com as inputs, this will give you http://www.yahoo.com/www.google.com as output, which probably isn't what you wanted. So you'll still have to check somehow if the second one is a url without a schema, or if actually a relative path.

Alexander Ovchinnikov · Accepted Answer · 2014-01-23 02:40:38Z

3

Can't comment accepted answer, so write this comment as new answer: IMO checking scheme in accepted answer ( bool(urlparse.urlparse(url).scheme) ) is not really good idea because of http://example.com/file.jpg, https://example.com/file.jpg and //example.com/file.jpg are absolute urls but in last case we get scheme = ''

I use this code:

is_absolute = True if '//' in my_url else False

answered Jan 23, 2014 at 2:40

Alexander Ovchinnikov

3891 gold badge5 silver badges10 bronze badges

1 Comment

guettli Over a year ago

AFAIK //foo/bar is a valid relative URL. With "relative" meaning "without scheme and netloc".

Belegnar · Accepted Answer · 2023-11-13 22:29:49Z

-2

pip install yarl

import yarl

if not yarl.URL(image).is_absolute():
    image = context["request"].build_absolute_uri(image)

because

yarl.URL("//google.com").is_absolute() is True
True

in the opposite to

urllib.parse.urlsplit("//google.com").scheme == ""
True

netloc is still defined though

urllib.parse.urlsplit("//google.com").netloc == "google.com"

Pros.

easier to read
easier to test (you can mock one particular method)

Cons.

extra deps (but pretty stable one)

edited Nov 13, 2023 at 22:29

answered Oct 28, 2023 at 8:48

Belegnar

9173 gold badges13 silver badges27 bronze badges

4 Comments

Chris Over a year ago

What is yarl? Please read How to Answer.

Belegnar Over a year ago

Python library. Well known

Chris Over a year ago

Don't assume that anything is "well known". The edited answer is better, but still not great: commands and code without any context rarely make a good answer. In this case you're asking folks to install some random package. What makes this better than the solution that uses the built-in urllib.parse.urljoin recommended in this answer? Again, please read How to Answer.

Belegnar Over a year ago

Using yarl is better because you move away from knowing the internal structure of the url. Easier to use and easier to test

Collectives™ on Stack Overflow

How can I check if a URL is absolute using Python?

4 Answers 4

Python 2

Python 3

6 Comments

2 Comments

1 Comment

4 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Python 2

Python 3

6 Comments

2 Comments

1 Comment

4 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related