Permutations of a numpy array into an ndarray or matrix

Question

foo = np.array([1,2,3,4])

I have a numpy array foo that I would like to transform into an ndarry or a matrix, similar to this:

bar = np.array([[1,2,3,4],[2,3,4,1],[3,4,1,2],[4,1,2,3]])

Any suggestions on how to do this efficiently, as my source array foo will vary in size, and I'll need to do this transformation millions of times.

Did either of the posted solutions work for you?

Divakar
– Divakar

2017-10-27 15:49:19 +00:00
Commented Oct 27, 2017 at 15:49 — Divakar
– Divakar, Commented Oct 27, 2017 at 15:49

cs95 · Accepted Answer · 2017-10-21 02:04:24Z

3

You could use np.roll in a loop.

x = np.array([np.roll(foo, -x) for x in np.arange(foo.shape[0])])

print(x)
array([[1, 2, 3, 4],
       [2, 3, 4, 1],
       [3, 4, 1, 2],
       [4, 1, 2, 3]])

answered Oct 21, 2017 at 2:04

cs95

406k106 gold badges744 silver badges797 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Divakar · Accepted Answer · 2017-10-21 03:57:11Z

3

For a massive performance we could incorporate strides here. The trick is to concatenate the original array with the sliced array ending at the second last element and then taking sliding windows of lengths same as the length of the original array.

Hence, the implementation would be -

def strided_method(ar):
    a = np.concatenate(( ar, ar[:-1] ))
    L = len(ar)
    n = a.strides[0]
    return np.lib.stride_tricks.as_strided(a, (L,L), (n,n), writeable=False)

The output would be read-only and a view of the concatenated array and as such would have a constant time almost irrespective of the array size. This means a hugely efficient solution. If you need writable output with its own memory space, make a copy there, as shown in the timings later on.

Sample run -

In [51]: foo = np.array([1,2,3,4])

In [52]: strided_method(foo)
Out[52]: 
array([[1, 2, 3, 4],
       [2, 3, 4, 1],
       [3, 4, 1, 2],
       [4, 1, 2, 3]])

Runtime test -

In [53]: foo = np.random.randint(0,9,(1000))

# @cᴏʟᴅsᴘᴇᴇᴅ's loopy soln
In [54]: %timeit np.array([np.roll(foo, -x) for x in np.arange(foo.shape[0])])
100 loops, best of 3: 12.7 ms per loop

In [55]: %timeit strided_method(foo)
100000 loops, best of 3: 7.46 µs per loop

In [56]: %timeit strided_method(foo).copy()
1000 loops, best of 3: 454 µs per loop

edited Oct 21, 2017 at 3:57

answered Oct 21, 2017 at 3:47

Divakar

222k19 gold badges273 silver badges374 bronze badges

1 Comment

cs95 Over a year ago

Was waiting for you to post!

percusse · Accepted Answer · 2017-10-21 10:18:08Z

2

These matrices are called Hankel matrices. Most platforms offer already a specific routine for creating them. You can also implement yourself by removing the unnecessary parts to increase speed. It's a pretty concise code

from scipy.linalg import hankel

A = hankel([1,2,3,4], [4,1,2,3])
A
array([[1, 2, 3, 4],
       [2, 3, 4, 1],
       [3, 4, 1, 2],
       [4, 1, 2, 3]])

It seems that it's only ~2x slower than Divakar's solution which is surprisingly fast.

answered Oct 21, 2017 at 10:18

percusse

3,1261 gold badge16 silver badges29 bronze badges

Collectives™ on Stack Overflow

Permutations of a numpy array into an ndarray or matrix

3 Answers 3

Comments

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related