Some problems with numpy array

Question

li = np.array(list("123"))
li[0] = "fff"
print(li)

prints ['f' '2' '3'] and not ['fff' '2' '3']. Why so? How can I tune/fix it?

Hint: what is the dtype of li?

John Coleman
– John Coleman

2021-06-12 17:48:46 +00:00
Commented Jun 12, 2021 at 17:48 — John Coleman
– John Coleman, Commented Jun 12, 2021 at 17:48

L.Grozinger · Accepted Answer · 2021-06-12 17:57:14Z

Why so?

Some debugging will help here. Take the first line for example:

>>> li = np.array(list("123"))

Then what is li?

>>> li
array(['1', '2', '3'], dtype='<U1')

Note the datatype of li, <U1, the docs explain that this datatype is for a unicode string of length 1. numpy has chosen this type for you, based on the elements you provided in the constructor, in order that it can allocate memory for li and probably do some optimisation in the background.

What happens when you do li[0] = "fff" then, since "fff" is not a unicode string of length 1? Then, numpy will convert "fff" to a unicode string of length 1, which in this case, appears to be "f".

>>> li[0] = "fff"
>>> li
array(['f', '2', '3'], dtype='<U1')

Now the print output should make sense.

What to do about it

You can explicitly pass a datatype when you construct your array. E.g.

>>> li = np.array(list("123"), dtype="<U3")
>>> li
array(['1', '2', '3'], dtype='<U3')
>>> li[0] = "fff"
>>> li
array(['fff', '2', '3'], dtype='<U3')

You will of course then have to decide in advance which type the array really will be, in this case, the maximum length of the string you can have is 3.

If you can make no guarantees about the types in the array, consider instead using a list instead of a numpy.array as a data structure.

Collectives™ on Stack Overflow

Some problems with numpy array

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related