casting array to variable

Question

I need efficient way to cast part of array to variable. Let's suppose array is defined as this:

unsigned char bytes[240];

now, I need to get uint32_t value from somewhere in the array, something like this:

uint32_t * word = reinterpret_cast<uint32_t *>(bytes[4]);

Which I think will get me second word in the array right? My question is, is this safe and portable (windows, linux on x86 & x86_64, os x would be nice, don't care about arm, ia-64 etc).

should be uint32_t * word = reinterpret_cast<uint32_t *>(&bytes[4]); otherwise you're casting the value of bytes[4] into a pointer. — jsantander
– jsantander, Commented Apr 22, 2014 at 11:37
The problem is, it's not portable across different machine architectures or data coming from network connections (also see Endianess) — πάντα ῥεῖ
– πάντα ῥεῖ, Commented Apr 22, 2014 at 11:38
@πάνταῥεῖ what you said is correct, but the OP only seem to care about intel machines. — jsantander
– jsantander, Commented Apr 22, 2014 at 11:42

Mankarse · Accepted Answer · 2014-04-22 12:45:03Z

4

You should use memcpy. This portably ensures that there are no alignment or strict aliasing problems. If no copy is needed, compilers are often smart enough to figure this out and directly reference the data in the array:

uint32_t value;
memcpy(&value, &bytes[4], sizeof value);
//Modify value:
//...
//Copy back to array:
memcpy(&bytes[4], &value, sizeof value);

edited Apr 22, 2014 at 12:45

answered Apr 22, 2014 at 12:22

Mankarse

40.9k12 gold badges105 silver badges146 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Z boson Over a year ago

How efficient is memcpy for small sizes? I have only used it for copying large blocks but maybe I should be using it for small blocks as well.

cmaster - reinstate monica · Accepted Answer · 2014-04-22 13:06:00Z

2

What you do does not violate strict aliasing rules because you cast to/from a char type pointer. In the standard, pointers to char types are the only exception from the strict aliasing rules.

As others have pointed out, you can run into the problem of alignment when you cast a char* to a larger type. You can either work around this by doing the alignment yourself, or just use memcpy() as Mankarse suggests.

But even the memcpy() approach is subject to byte order problems: If you've written your program on a little endian machine (x86 for example), it will likely crash on a big endian machine (ARM for example), and vice versa.

So, if you want to write portable code, you need to use a byte order that you specify. You can easily do so using the bit shift operators:

int32_t read_word_le(signed char* bytes) {
    return (int32_t)bytes[0] +
        ((int32_t)bytes[1] << 8) +
        ((int32_t)bytes[2] << 16) +
        ((int32_t)bytes[3] << 24);
}

int32_t read_word_be(signed char* bytes) {
    return (int32_t)bytes[3] +
        ((int32_t)bytes[2] << 8) +
        ((int32_t)bytes[1] << 16) +
        ((int32_t)bytes[0] << 24);
}

edited Apr 22, 2014 at 13:06

answered Apr 22, 2014 at 12:58

cmaster - reinstate monica

41.1k9 gold badges69 silver badges110 bronze badges

1 Comment

Mankarse Over a year ago

Good point about byte order problems. My answer is only valid if the data in the array was originally put there by the same program (on the same platform with the same compiler). This is often a valid assumption, but it is obviously false if the data is being moved around a network or otherwise persisted between program runs.

Joky · Accepted Answer · 2014-04-22 22:41:20Z

0

I would avoid indexing on the char if I know what is in the buffer. If it is indeed an array of int, cast first and index after for clarity. If you want the second 32 bits integer in the array:

uint32_t * words = reinterpret_cast(bytes); uint32_t second = words[1];

It is hard to answer about portability as you don't provide much information on the use case. As long as the data in the bytes buffer is produced and used on the same machine, the code is portable (and would be using simply int). Things become messy when you exchange data produced on a different architecture.

answered Apr 22, 2014 at 22:41

Joky

1,63811 silver badges16 bronze badges

Collectives™ on Stack Overflow

casting array to variable

3 Answers 3

1 Comment

1 Comment

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related