How to concatenate and output unicode text variables in Python

Question

My title terms might not be correct and may be a reason why I can't find this simple thing from websites.

I have a list of string variables. How do I actually concatenate them and output a real unicode sentence in Python?

base = ['280', '281', '282', '283']
end = ['0','1','2','3','4','5','6','7','8','9','a','b','c','d','e','f']
unicodes = [u''.join(['\u', j, i]) for j in base for i in end]

for u in unicodes:
    print u

I will get only strings like '\u280F' but not real character. But if I do:

print u'\u280F'

correct symbols shows up, which is: ⠏

And I'm sure there is a more elegant way to get a range of the symbols from u2800 to u283F...

falsetru · Accepted Answer · 2015-06-25 08:32:00Z

5

Conver the strings to integers (using int with base 16), the use unichr (chr if you're using Python 3.x) to convert the number into unicode object.

>>> int('280' + 'F', 16)  # => 0x280F, 16: hexadecimal
10255
>>> unichr(int('280' + 'F', 16))  # to unicode object
u'\u280f'
>>> print unichr(int('280' + 'F', 16))
⠏

base = ['280', '281', '282', '283']
end = ['0','1','2','3','4','5','6','7','8','9','a','b','c','d','e','f']
unicodes = [unichr(int(j + i, 16)) for j in base for i in end]

for u in unicodes:
    print u

answered Jun 25, 2015 at 8:32

falsetru

371k69 gold badges769 silver badges659 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

jfs · Accepted Answer · 2015-06-25 20:42:59Z

0

If you are stuck with unicodes input; you could use unicode-escape codecs, to get Unicode (b'\\u2800'.decode('unicode-escape') == u'\u2800'):

>>> for escaped in unicodes: print escaped.decode('unicode-escape')
...
⠽
⠾
⠿

Otherwise, generate range of integers directly:

for ordinal in range(0x2800, 0x283f + 1):
    print unichr(ordinal)

It produces the same output in this case.

answered Jun 25, 2015 at 20:42

jfs

417k210 gold badges1k silver badges1.7k bronze badges

Collectives™ on Stack Overflow

How to concatenate and output unicode text variables in Python

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related