complexity analysis of algorithm

Question

here is code, which fills two dimensional array with random genarated numbers in range [1 19] without duplication, my question is: how to determine it's complexity?

For example, I see that its running time is at least O(n^2), because of its inner and outer cycles, but that about the goto statement?

Here is my code:

#include <iostream>
#include <set>
#include <cstdlib>
using namespace std;

int main()
{
    int min=1;
    int max=19;
    int  a[3][3];
    set<int>b;

    for (int i=0; i<3; i++)
    {
        for (int j=0; j<3; j++)
        {
loop:
            int m=min+rand()%(max-min);

            if (b.find(m)==b.end())
            {
                a[i][j]=m;
                b.insert(m);
            }
            else
                goto loop;
        }
    }

    for (int i=0; i<3; i++)
    {
        for (int j=0; j<3; j++)
            cout<< a[i][j]<<"  ";
        cout<<endl;
    }
    return 0;
}

I would say that complexity of algorithm is c*O(n^2) where c is some constant, it is because if it finds duplicated element inside cycles it repeats generation of random numbers and takes some constant time, am I right?

Also, please fix the indenting in your code. It's very hard to read. — Oliver Charlesworth
– Oliver Charlesworth, Commented Sep 15, 2011 at 11:20
It makes no sense to say c*O(something), since the O(something) already means that some multiplicative constant is involved. — Matteo Italia
– Matteo Italia, Commented Sep 15, 2011 at 11:26
"for example, i see, that at least it's running time is O(n^2) because of it's inner and outer cycles" don't make the assumption that two nested cycles implies O(N^2). — R. Martinho Fernandes
– R. Martinho Fernandes, Commented Sep 15, 2011 at 11:26
To be clear: it's currently impossible to say what the complexity is, because you haven't made it clear what n corresponds to. — Oliver Charlesworth
– Oliver Charlesworth, Commented Sep 15, 2011 at 11:30

Mikael Öhman · Accepted Answer · 2011-09-15 11:51:42Z

3

As the likelihood of getting a working number decreases, the number of goto-loops increases. For a uniform random number generator, the behavior is linear with respect to the number of.. numbers. It definitely doesn't add a constant to your complexity.

If n is the number of elements in a, then it'll on average scale with O(n²). (or if n is the number of rows in the square matrix a; O(n⁴)).

A much simpler implementation would be using Fisher-Yates shuffle

answered Sep 15, 2011 at 11:51

Mikael Öhman

2,50016 silver badges22 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

MSalters Over a year ago

This critically depends on max versus N. If max grows as O(N*N), then the average number of goto loops doesn't grow. And obviously, if max doesn't grow at all, then the algorithm breaks when N*N > max. You can't fill a 5x5 matrix with only 20 unique numbers.

MSalters · Accepted Answer · 2011-09-15 12:34:05Z

2

It's O(infinity). The O notation gives an upper bound. Because of your use of rand() in a loop, there's no guarantee that you will make progress. Therefore, no upper bound exists.

[edit] Ok, people also want other complexities than the conventional, worst-case complexity.

The worst-case complexity obtained by assuming that the RNG generates an infinite series of ones; this means that even the first loop iteration doesn't finish. Therefore there's no finite upper bound on the run time, O(infinity).

The best-case complexity is obtained by assuming that the RNG generates sequential numbers. That means the cost of each iteration is O(log N) (set::find), and there are O(N)*O(N) iterations, so the upper bound is O(N² log N).

The average case complexity is harder. Assuming that max = k*N*N for some k > 1, the RNG will succesfully pick an "unused" number in O(1) time. Even after N*N numbers are chosen, there are still (k-1) unused numbers, so the chance p of picking an unused number is p >= (k-1)*(N*N)/k*(N*N) <=> p>= (k-1)/k. That means we can expect to pick an unused number in k/(k-1) attempts, which is independent of N and therefore O(1). set::find still dominates the cost of each iteration, at O(log N). We still have the same number of iterations, so we get the same upper bound of O(N² log N)

edited Sep 15, 2011 at 12:34

answered Sep 15, 2011 at 11:30

MSalters

182k11 gold badges171 silver badges376 bronze badges

10 Comments

Oliver Charlesworth Over a year ago

Big-O does not mean "worst-case". It is, however, fair to say "the worst-case complexity is O(infinity)".

MSalters Over a year ago

It does mean upper bound, see en.wikipedia.org/wiki/… . Lower bound is Ω(f(N)), combined bound is Θ(f(N))

arne Over a year ago

Damn, you were faster than me. +1 anyway.

flight Over a year ago

There's a difference between "worst-case" and "upper bound". You can have a lower bound on the worst case or an upper bound on the best case etc.

Chris A. Over a year ago

Not going to downvote, but all you have to do is assume that the random number generator is uniform to guarantee convergence in probability. Furthermore, you can work out the expected complexity. So I doubt that it's O(infinity)

|

Emilio Garavaglia · Accepted Answer · 2011-09-15 11:57:37Z

1

The goto loops until a random number equals a given one. if the distribution of random numbers is uniform, "retry ... until" is "linear in average" respect to the amplitude of the range. But this linearity gos to multiply the complexity of set::find (log(n)) (ste::insert just happen once)

The two external for are based on constants (so their timing doesn't depend on the data), hence they just multiply the time, but don't increase complexity.

answered Sep 15, 2011 at 11:57

Emilio Garavaglia

20.9k3 gold badges50 silver badges66 bronze badges

Comments

Branko Dimitrijevic · Accepted Answer · 2011-09-15 13:22:14Z

"Complexity" is not about how much absolute time (or space) your program takes. It is about how much the time (or space) increases when you increase the size of your program's input data.

(BTW O for time and O for space may be different.)

Time Complexity

Assuming n is number of elements in the matrix, you have to ask yourself what happens when you add a single element to your matrix (i.e. when n becomes n+1):

You need to iterate over the new element, which is O(1). We are talking about one iteration here, so double loop does not matter.
You have another iteration for printing, which is also O(1), assuming cout<< is O(1).
You have to find the element which is O(log(n)) - the std::set is typically implemented as a red-black tree.
You have to retry the find (via goto) potentially several times. Depending on rnd, min, max and the width of int, number of retries may be O(1) (i.e. it does not increase with increase in number of elements) or it may be worse than that.
You have to insert the element which is O(log(n)).

Assuming the "best" rnd, you are looking at the following increase for one element...

(O(1) + O(1)) * (O(log(n)) * O(1) + O(log(n)) = O(1) * O(log(n)) = O(log(n))

...so for n elements, your complexity is:

(O(n) + O(n)) * (O(log(n)) * O(1) + O(log(n)) = O(n) * O(log(n)) = O(n * log(n))

Assuming "bad" rnd of O(n), you are looking at...

(O(n) + O(n)) * (O(log(n)) * O(n) + O(log(n)) = O(n) * O(n * log(n)) = O(n^2 * log(n))

Space Complexity

Your matrix is O(n) and std::set is O(n) so you are O(n) here overall.

Collectives™ on Stack Overflow

complexity analysis of algorithm

4 Answers 4

1 Comment

10 Comments

Comments

Time Complexity

Space Complexity

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

1 Comment

10 Comments

Comments

Time Complexity

Space Complexity

Comments

Your Answer

Sign up or log in

Post as a guest

Related