Python deepcopy() vs just initiating a numpy array in terms of run-time speed?

Question

I am curious if

elevation_arr = numpy.zeros([900, 1600], numpy.float32)
climate_arr = copy.deepcopy(elevation_arr)
rainfall_arr = copy.deepcopy(elevation_arr)

is faster or slower to execute than

elevation_arr = numpy.zeros([900, 1600], numpy.float32)
climate_arr = numpy.zeros([900, 1600], numpy.float32)
rainfall_arr = numpy.zeros([900, 1600], numpy.float32)

Any particular reason for using copy..deepcopy()? I think elevation_arr.copy() would anyways be equivalent to a deep copy, since the elements of elevation_arr are immutable ints. — fountainhead
– fountainhead, Commented Mar 23, 2019 at 22:58
@fountainhead I wanted to make sure the values wouldn't be references. I didn't know that immutable values would work like that. They are technically floats though aren't they? — LuminousNutria
– LuminousNutria, Commented Mar 23, 2019 at 23:03
True, they're float32s. But immutable, nevertheless, as are ints. My point is that, given an aggregate data structure (in this case numpy array) consisting of many objects, a shallow copy is good enough if the objects in the aggregation are all immutable. Only when the objects are mutable, it becomes important to make the copy a deep one, by making copies of the elements themselves. — fountainhead
– fountainhead, Commented Mar 23, 2019 at 23:21
For an array like yours: copy.copy(arr), copy.deepcopy(arr), arr.copy(), np.array(arr, copy=True) all time the same. deepcopy is only significantly different if the array has object dtype (same as if a list contains lists or dictionaries). — hpaulj
– hpaulj, Commented Mar 24, 2019 at 1:38

Sheldore · Accepted Answer · 2019-03-23 22:56:54Z

1

numpy_zeros performs slightly better for smaller arrays and much better for larger arrays as shown below

import copy
import numpy as np

def deep_copy():
    elevation_arr = np.zeros([900, 1600], np.float32)
    climate_arr = copy.deepcopy(elevation_arr)
    rainfall_arr = copy.deepcopy(elevation_arr)
    return 

def numpy_zeros():
    elevation_arr = np.zeros([900, 1600], np.float32)
    climate_arr = np.zeros([900, 1600], np.float32)
    rainfall_arr = np.zeros([900, 1600], np.float32)
    return

%timeit deep_copy()
# 4.13 ms ± 585 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

%timeit numpy_zeros()
# 3.01 ms ± 195 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

For a 10000 x 10000 array, following are the timings. numpy_zeros simply outperforms

%timeit deep_copy()
# 569 ms ± 50 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

%timeit numpy_zeros()
# 15.6 µs ± 1.38 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

edited Mar 23, 2019 at 22:56

answered Mar 23, 2019 at 22:43

Sheldore

39.2k9 gold badges63 silver badges76 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

LuminousNutria Over a year ago

I didn't know you could time stuff like that, thank you.

Collectives™ on Stack Overflow

Python deepcopy() vs just initiating a numpy array in terms of run-time speed?

1 Answer 1

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related