Poor CPU usage in a parallel loop

In the code below, I have a parallel section where each thread uses a private vector<int> to push and pop integers. The problem I have is that as I increase the number of threads, the performance of each core decreases drastically, and the Kernel CPU usage (red bar in the htop command) assigned to each core increases a lot. For example, with 1 core I have 100% normal CPU usage, but with 25 cores (see image) almost all the CPU usage goes to the kernel.

It is probably something very basic, but I just don't know why it happens. I would expect that since each thread has its own private variable each core would work exactly the same no matter the number of total CPUS used in the parallel section.

Any advise?

int cpus = 25;
#pragma omp parallel for schedule(dynamic,1)
for (int ss = 0; ss < cpus; ss++)
{
    std::vector<int> q;
    while (true)
    {
        q.push_back(rand());
        q.pop_back();
    }
}

asked Aug 19, 2020 at 7:19

SandiaDeDia

2912 silver badges11 bronze badges

3

stackoverflow.com/questions/6161322/…

Mat
– Mat

2020-08-19 07:24:43 +00:00
Commented Aug 19, 2020 at 7:24
6

en.cppreference.com/w/cpp/numeric/random/rand: "It is implementation-defined whether rand() is thread-safe." - using stuff from <random> is probably better.

Mat
– Mat

2020-08-19 07:26:16 +00:00
Commented Aug 19, 2020 at 7:26
Thank you all!!!! yes, I just tried removing the random and it is working.

SandiaDeDia
– SandiaDeDia

2020-08-19 07:28:49 +00:00
Commented Aug 19, 2020 at 7:28
2

rand can either be not thread-safe, which makes your program undefined, or thread-safe, which makes your program non-parallel. (I suspect that the program spends all that kernel time waiting for it.)

molbdnilo
– molbdnilo

2020-08-19 07:29:12 +00:00
Commented Aug 19, 2020 at 7:29
The GLIBC implementation of rand() is not thread-safe.

Hristo Iliev
– Hristo Iliev

2020-08-19 08:00:41 +00:00
Commented Aug 19, 2020 at 8:00

| Show 3 more comments

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Poor CPU usage in a parallel loop [duplicate]

0

Linked

Hot Network Questions