find distinct elements in an array using c++

Question

I have an array of integers. I want to find all the distinct elements in the array using c++. solutin 1: BRUTE FORCE using nested loop, complexity of this solution is O(n^2)

solution 2: SORTING this will take O(nLog n)

Is there any other technique which can give better results than O(n Log n)? any other data structure or any different technique?

By distinct you mean, that have frequency 1?

Sumeet
– Sumeet

2015-08-22 04:57:00 +00:00
Commented Aug 22, 2015 at 4:57 — Sumeet
– Sumeet, Commented Aug 22, 2015 at 4:57

jcai · Accepted Answer · 2015-08-22 04:54:05Z

4

Using a std::unordered_set will be O(n).

answered Aug 22, 2015 at 4:54

jcai

3,6033 gold badges23 silver badges37 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Jens Over a year ago

The average complexity for hash tables is O(n), not the worst-case. It is possible to do this in O(n) when the max. integer is known and small enough.

jcai Over a year ago

@Jens Hash table insertion is O(1) amortized, which is a stronger guarantee than merely average case. It is not possible to find "worst-case" inputs for which the average insert in a (long enough) sequence of inserts is worse than O(1).

dhaumann · Accepted Answer · 2015-08-22 11:35:31Z

0

You can also try using std::nth_element: It partially sorts the range [first, n-th, last) such that all entries in the interval [first, n-th) are <= n-th, and all elements in the interval (n-th, last) are >= than n-th. It has linear complexity (O(n)), and will find the n-th element in a sequence. It's not suited to find a specific number, though, so maybe it's not exactly what you need. But it's worth to keep it in mind :-)

answered Aug 22, 2015 at 11:35

dhaumann

1,69814 silver badges28 bronze badges

Comments

Jens · Accepted Answer · 2015-08-22 09:08:40Z

-1

If you know the maximum integer number, and it is reasonably small, you cann allocate a large vector and use that to count the frequency of each integer. Then, iterate over the vector and find all with frequency one:

template<typename I>
auto findWithFrequency(int f, int max, I first, I last)
{
    std::vector<int> counts(max, 0);
    for(; first != last; ++first)
    {
        counts[*first] += 1;
    }

    std::vector<typename I::value_type> v;
    v.reserve( std::distance(first, last) );

    std::copy_if(counts.begin(), counts.end(),
                 std::back_inserter(v),
                 [f](auto x) {return x == f;});

    return v;
}

In the worst case, this needs two iterations over arrays of the size of the input array, so the complexity is O(n).

This is essentially the idea behind Bucketsort or Radix.

answered Aug 22, 2015 at 9:08

Jens

9,4483 gold badges30 silver badges49 bronze badges

3 Comments

sam Over a year ago

your Technic Does decrease the time less than . O(n Log n) . maybe HASH would help .

Jens Over a year ago

@sam I don't understand your comment. The algorithm first iterates over the input array, which is O(n). It then iterates over the bucket array, which is also O(n). Where would a hash function help? The complexity for hash tables is O(n) on average, but Bucketsort or Radixsort have a worst-case complexity of O(n).

Jens Over a year ago

Would the downvoter comment why the -1? I think it is a reasonable approach for restricted integers.

Collectives™ on Stack Overflow

find distinct elements in an array using c++

3 Answers 3

2 Comments

Comments

3 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

2 Comments

Comments

3 Comments

Your Answer

Sign up or log in

Post as a guest

Related