Is there any way in Python to transform this %CE%B1%CE%BB%20 into this αλ which is its real representation?
3 Answers
For python 2:
>>> import urllib2
>>> print urllib2.unquote("%CE%B1%CE%BB%20")
αλ
For python 3:
>>> from urllib.parse import unquote
>>> print(unquote("%CE%B1%CE%BB%20"))
αλ
And here's code that works in all versions:
try:
from urllib import unquote
except ImportError:
from urllib.parse import unquote
print(unquote("%CE%B1%CE%BB%20"))
2 Comments
user136036
For Python 3:
import urllib.request urllib.request.unquote(...)rwreed
urllib has now put unqote in the parse module
from urllib.parse import unquote (git.ziirish.me/ziirish/burp-ui/issues/75)There are two encodings in play here. Your string has first been encoded as UTF-8, then each byte has been percent-encoded.
To get the original string back you need to first unquote it, and then decode it:
>>> import urllib
>>> s = '%CE%B1%CE%BB%20'
>>> result = urllib.unquote(s).decode('utf8')
>>> print result
αλ
Note that you need a Unicode enabled console in order to display the value (if you get an error with the print statement, try running it in IDLE).
2 Comments
hytromo
thanks a lot! decode('utf8') was useless to me, though. unquote(s) did the job!
Lord_Gestalter
But it does. Just tried it to be on the safe side (IDLE, py2.7)