Convert ASCII string encoded array into string?

Question

Input:

'0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'

Output:

Hello!

Current Solution:

    s = []
    birary_data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'.replace(' ', '').split('0x')
    for c in birary_data:
        if len(c) > 1:
            s.append(bytes.fromhex(c).decode('utf-8', 'ignore'))
    print("".join(s))

Need help with:

Could anyone suggest a more elegant solution, please?

@MauriceMeyer, it feels like I'm doing extra and unnecessary steps. However, there could a very simple solution, like birary_data.decode('hex') — Gооd_Mаn
– Gооd_Mаn, Commented Mar 5, 2020 at 11:41

Shubham Sharma · Accepted Answer · 2020-03-05 11:50:19Z

4

Try this:

data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'
string = "".join([chr(int(item, 16)) for item in data.split()])
print(string)

Output:

Hello!

answered Mar 5, 2020 at 11:50

Shubham Sharma

71.8k6 gold badges26 silver badges58 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Nullman Over a year ago

I always forget that int can take a base, this is a good solution

Jon Betts Over a year ago

Doesn't this have a leading null byte and something else before the last char?

Guy · Accepted Answer · 2020-03-05 12:08:51Z

4

Another option is to remove 3 characters (or less) length substrings, 0x and white spaces. bytes.fromhex can handle a string like '48656c6c6f8E21'

binary_data = '0X0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'
binary_data = re.sub(r'\b\w{3}\b|\s?0x', '', binary_data)
print(bytes.fromhex(binary_data).decode('utf-8', 'ignore'))

edited Mar 5, 2020 at 12:08

answered Mar 5, 2020 at 12:01

Guy

51.2k10 gold badges49 silver badges96 bronze badges

1 Comment

Nullman Over a year ago

why not change your pattern to r'\b\w{3}\b|\s?0x' to get '48656c6c6f8E21' directly?

Akhil Pathania · Accepted Answer · 2020-03-05 11:55:09Z

3

You can use the below one Here in the code i am first splitting the hex according to white-space and then iterating and joining the character i get.

a = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'
print(''.join(chr(int(i, 16)) for i in a.split()))

answered Mar 5, 2020 at 11:55

Akhil Pathania

7522 gold badges6 silver badges20 bronze badges

Comments

Jon Betts · Accepted Answer · 2020-03-05 12:06:21Z

3

The builtin bytes.fromhex() is very nearly all we need. There are however two problems we need to get around:

The null byte at the front
The invalid char in position 6 (0x8E)

import re

data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'

string = bytes.fromhex(re.sub('0x(0 )?', '', data)).decode('utf-8', 'ignore')

The regex will take care of both stripping the null byte and formatting the string correctly for bytes.fromhex(). The ignore in the decode will skip the bad byte.

answered Mar 5, 2020 at 12:06

Jon Betts

3,3981 gold badge20 silver badges15 bronze badges

Comments

Joan Lara · Accepted Answer · 2020-03-05 12:02:40Z

2

birary_data = '0x0 0x48 0x65 0x6c 0x6c 0x6f 0x8E 0x21'.replace('0x', '').split()
print(bytearray.fromhex(''.join(c for c in birary_data if len(c) > 1)).decode('utf-8', 'ignore'))

Output:

Hello!

edited Mar 5, 2020 at 12:02

answered Mar 5, 2020 at 11:36

Joan Lara

1,3978 silver badges15 bronze badges

Collectives™ on Stack Overflow

Convert ASCII string encoded array into string?

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related