Binary random array with a specific proportion of ones?

Question

What is the efficient(probably vectorized with Matlab terminology) way to generate random number of zeros and ones with a specific proportion? Specially with Numpy?

As my case is special for 1/3, my code is:

import numpy as np 
a=np.mod(np.multiply(np.random.randomintegers(0,2,size)),3)

But is there any built-in function that could handle this more effeciently at least for the situation of K/N where K and N are natural numbers?

Do you need the proportion to be exactly the given value, or is that just the expected proportion of the sample? — Warren Weckesser
– Warren Weckesser, Commented Oct 25, 2013 at 19:11
Also, what should happen for the 1/3 case when size is not divisible by 3? Exception? Round/floor/trunc? Weighted random round (so 10 has a 2/3 chance of 3 and a 1/3 chance of 4)? — abarnert
– abarnert, Commented Oct 25, 2013 at 19:15
@WarrenWeckesser, its the expected proportion in my case. I wished you didn't deleter your answer so I would have accepted it. — Cupitor
– Cupitor, Commented Oct 25, 2013 at 19:16
@Naji: I restored my answer. If you had needed the exact proportion, that method wouldn't work. — Warren Weckesser
– Warren Weckesser, Commented Oct 25, 2013 at 19:27
@Naji: Whatever you want? I wanted it to generate a trillion dollars, and all it gave me was an array. I suppose I'm not believing hard enough. ;) — abarnert
– abarnert, Commented Oct 25, 2013 at 20:15

Jaime · Accepted Answer · 2013-10-25 19:18:58Z

130

Yet another approach, using np.random.choice:

>>> np.random.choice([0, 1], size=(10,), p=[1./3, 2./3])
array([0, 1, 1, 1, 1, 0, 0, 0, 0, 0])

answered Oct 25, 2013 at 19:18

Jaime

67.7k19 gold badges128 silver badges164 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

abcd Over a year ago

note that this approach will not give you the exact proportion of zeros and ones you request . . . the answer by @mdml below will.

JFFIGK Over a year ago

true, and since it is accepted, I think Cupitor might have added a bug to his program

Warren Weckesser Over a year ago

@JFFIGK, dbliss: this was discussed in the comments to the question. Those comments are still there, so take a look.

Alireza Rezaee Over a year ago

Since the mentioned link is broken, see: numpy.random.choice.

mdml · Accepted Answer · 2013-10-25 19:09:51Z

52

A simple way to do this would be to first generate an ndarray with the proportion of zeros and ones you want:

>>> import numpy as np
>>> N = 100
>>> K = 30 # K zeros, N-K ones
>>> arr = np.array([0] * K + [1] * (N-K))
>>> arr
array([0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
       0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
       1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
       1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
       1, 1, 1, 1, 1, 1, 1, 1])

Then you can just shuffle the array, making the distribution random:

>>> np.random.shuffle(arr)
>>> arr
array([1, 1, 1, 0, 1, 0, 1, 1, 1, 1, 1, 1, 0, 0, 1, 1, 1, 0, 1, 0, 0, 1, 0,
       1, 1, 0, 0, 1, 0, 1, 1, 0, 1, 1, 1, 1, 0, 1, 0, 1, 0, 1, 1, 0, 1, 1,
       1, 1, 1, 1, 0, 0, 0, 1, 1, 0, 1, 1, 0, 1, 1, 1, 1, 0, 1, 1, 1, 1, 1,
       0, 0, 0, 1, 1, 1, 0, 1, 1, 1, 1, 1, 1, 0, 1, 1, 1, 0, 1, 1, 1, 0, 1,
       1, 1, 1, 0, 1, 1, 1, 1])

Note that this approach will give you the exact proportion of zeros/ones you request, unlike say the binomial approach. If you don't need the exact proportion, then the binomial approach will work just fine.

answered Oct 25, 2013 at 19:09

mdml

23k8 gold badges61 silver badges66 bronze badges

2 Comments

Cupitor Over a year ago

How stupid of me! Right I forgot about binary distribution. Actually somebody posted binary right before you but he deleted his answer(dont know why!!)

mxmlnkn Over a year ago

This is quite clever

Abhijit · Accepted Answer · 2013-10-25 19:10:33Z

23

If I understand your problem correctly, you might get some help with numpy.random.shuffle

>>> def rand_bin_array(K, N):
    arr = np.zeros(N)
    arr[:K]  = 1
    np.random.shuffle(arr)
    return arr

>>> rand_bin_array(5,15)
array([ 0.,  1.,  0.,  1.,  1.,  1.,  0.,  0.,  0.,  1.,  0.,  0.,  0.,
        0.,  0.])

answered Oct 25, 2013 at 19:10

Abhijit

64k20 gold badges143 silver badges209 bronze badges

Comments

Warren Weckesser · Accepted Answer · 2013-10-25 19:07:30Z

21

You can use numpy.random.binomial. E.g. suppose frac is the proportion of ones:

In [50]: frac = 0.15

In [51]: sample = np.random.binomial(1, frac, size=10000)

In [52]: sample.sum()
Out[52]: 1567

answered Oct 25, 2013 at 19:07

Warren Weckesser

116k20 gold badges207 silver badges224 bronze badges

3 Comments

Epimetheus Over a year ago

This doesn't guarantee the correct proportion of ones like mdml's answer does.

Warren Weckesser Over a year ago

@John, this was discussed in the comments to the question. Take a look.

Epimetheus Over a year ago

I see now! Of course the question needs editing then as it asks for specific proportion.

joelostblom · Accepted Answer · 2019-03-27 16:14:02Z

2

Another way of getting the exact number of ones and zeroes is to sample indices without replacement using np.random.choice:

arr_len = 30
num_ones = 8

arr = np.zeros(arr_len, dtype=int)
idx = np.random.choice(range(arr_len), num_ones, replace=False)
arr[idx] = 1

Out:

arr

array([0, 0, 0, 1, 0, 0, 0, 1, 0, 1, 0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 1, 1,
       0, 0, 0, 0, 0, 1, 0, 0])

answered Mar 27, 2019 at 16:14

joelostblom

49.8k20 gold badges166 silver badges179 bronze badges

Comments

Galactic Ketchup · Accepted Answer · 2019-08-03 19:29:03Z

1

Simple one-liner: you can avoid using lists of integers and probability distributions, which are unintuitive and overkill for this problem in my opinion, by simply working with bools first and then casting to int if necessary (though leaving it as a bool array should work in most cases).

>>> import numpy as np
>>> np.random.random(9) < 1/3.
array([False,  True,  True,  True,  True, False, False, False, False])   
>>> (np.random.random(9) < 1/3.).astype(int)
array([0, 0, 0, 0, 0, 1, 0, 0, 1])

edited Aug 3, 2019 at 19:29

answered Aug 1, 2018 at 14:33

Galactic Ketchup

5736 silver badges15 bronze badges

2 Comments

Epimetheus Over a year ago

This doesn't guarantee the correct proportion of ones like mdml's answer does.

Galactic Ketchup Over a year ago

The OP said they wanted 1/3 to be the expected proportion of 1s, not the exact proportion.

Adi · Accepted Answer · 2023-11-26 00:39:53Z

0

You can generate a nd-array with random binary members (0 and 1) directly in one line through the following method. You can also use np.random.random() instead of np.random.uniform().

>>import numpy as np
>>np.array([[round(np.random.uniform()) for i in range(3)] for j in  range(3)])
array([[1, 0, 0],
       [1, 1, 1],
       [0, 1, 0]])
>>

edited Nov 26, 2023 at 0:39

Adi

5213 silver badges13 bronze badges

answered Nov 24, 2023 at 20:23

Ali Chitsaz

11 bronze badge

Collectives™ on Stack Overflow

Binary random array with a specific proportion of ones?

7 Answers 7

4 Comments

2 Comments

Comments

3 Comments

Comments

2 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

4 Comments

2 Comments

Comments

3 Comments

Comments

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related