How can I select unique values from several columns in Oracle SQL?

Question

Basically, I've got the following table:

ID | Amount
AA | 10
AA | 20
BB | 30
BB | 40
CC | 10
CC | 50
DD | 20
DD | 60
EE | 30
EE | 70

I need to get unique entries in each column as in following example:

ID | Amount
AA | 10
BB | 30
CC | 50
DD | 60
EE | 70

So far following snippet gives almost what I wanted, but first_value() may return some value, which isn't unique in current column:

first_value(Amount) over (partition by ID)

Distinct also isn't helpful, as it returns unique rows, not its values

EDIT: Selection order doesn't matter

@jarlh And how would he display the Amount, based on the grouped ID values? — Radu Gheorghiu
– Radu Gheorghiu, Commented Jan 19, 2016 at 10:48
@RaduGheorghiu, good question, I can't see any pattern in how the selected values are chosen. Up to OP. (Or do you have any idea?) — jarlh
– jarlh, Commented Jan 19, 2016 at 10:59
Individual columns must be unique so either AA 10 or AA 20 could be chosen as long as AA isn't already chosen and (10 or 20) haven't already been chosen. it's stated that selection order does not matter. (edit sorry was late hitting enter) — Paul Maxwell
– Paul Maxwell, Commented Jan 19, 2016 at 12:11

user330315 · Accepted Answer · 2016-01-19 20:05:29Z

2

This works for me, even with the problematic combinations mentioned by Dimitri. I don't know how fast that is for larger volumes though

with ids as (
  select id, row_number() over (order by id) as rn
  from data
  group by id
), amounts as (
  select amount, row_number() over (order by amount) as rn
  from data
  group by amount
)
select i.id, a.amount
from ids i
  join amounts a on i.rn = a.rn;

SQLFiddle currently doesn't work for me, here is my test script:

create table data (id varchar(10), amount integer);

insert into data values ('AA',10);
insert into data values ('AA',20);
insert into data values ('BB',30);
insert into data values ('BB',40);
insert into data values ('CC',10);
insert into data values ('CC',50);
insert into data values ('DD',20);
insert into data values ('DD',60);
insert into data values ('EE',30);
insert into data values ('EE',70);

Output:

id | amount
---+-------
AA |     10
BB |     20
CC |     30
DD |     40
EE |     50

answered Jan 19, 2016 at 20:05

user330315

Sign up to request clarification or add additional context in comments.

3 Comments

Florin Ghita Over a year ago

The problem is easier if you can "break" the row. I mean, combination EE 50 does not exists in the input data. The problem as it appears in the OP, is very dificult. It may not have solution for some inputs, it may have many solution for other inputs.

user330315 Over a year ago

@FlorinGhita: ah, good point. I didn't understand it like that. I think the requirement "each value must be unique" can't be satisfied, if you aren't allowed to "break up" the rows.

Florin Ghita Over a year ago

However, I will vote this answer to bring the issue up. ;)

Paul Maxwell · Accepted Answer · 2016-01-19 10:57:06Z

0

I suggest using row_number() like this:

select ID ,Amount
from (
   select ID ,Amount, row_number() over(partition by id order by 1) as rn
   from yourtable
     )
where rn = 1

However your expected results don't conform to a discrenable order, some are the first/lowest while some the last/highest so I wasn't sure what to include for the ordering.

answered Jan 19, 2016 at 10:57

Paul Maxwell

35.7k4 gold badges39 silver badges55 bronze badges

2 Comments

Anton Over a year ago

It didn't work properly unfortunately, I still get repeating values in Amount column. Thank you for your time

Paul Maxwell Over a year ago

sorry, I missed the significance of those words, but now I understand what you need. mmmm

Dmitriy · Accepted Answer · 2016-01-19 13:14:59Z

0

My solution implements recursive with and makes following: first - select minival values of ID and amount, then for every next level searches values of ID and amount, which are more than already choosed (this provides uniqueness), and at the end query selects 1 row for every value of recursion level. But this is not an ultimate solution, because it is possible to find a combination of source data, where query will not work (I suppose, that such solution is impossible, at least in SQL).

with r (id, amount, lvl) as (select min(id), min(amount), 1
             from t
            union all
           select t.id, t.amount, r.lvl + 1
             from t, r
            where t.id > r.id and t.amount > r.amount)
select lvl, min(id), min(amount)
  from r
 group by lvl
 order by lvl

SQL Fiddle

edited Jan 19, 2016 at 13:14

answered Jan 19, 2016 at 12:41

Dmitriy

5,58512 gold badges27 silver badges39 bronze badges

1 Comment

Anton Over a year ago

The same thing as in @Aleksej case: query time is extremely large and I simply cannot wait until it finishes

Anton · Accepted Answer · 2016-01-19 15:39:46Z

0

I knew that there is an elegant solution! Thanks to friend of mine for a tip:

select max(ID), mAmount from (
  select ID, max(Amount) mAmount from table group by ID
)
group by mAmount;

answered Jan 19, 2016 at 15:39

Anton

4712 gold badges8 silver badges20 bronze badges

1 Comment

Dmitriy Over a year ago

It's elegant, but incorrect :) Try combination: (BB, 30), (BB, 40), (CC, 30), (CC, 40) My solution is the only solution, which gives correct answer with this combination (In fact, such combinations could be found for my solution too, but this solution is most weak).

Aleksej · Accepted Answer · 2016-01-20 08:26:00Z

0

Maybe something like this can solve:

WITH tx AS
     (  SELECT ROWNUM ROW_NUMBER,
               t.id,
               t.amount
          FROM test t
               INNER JOIN test t2
                   ON     t.id = t2.id
                      AND t.amount != t2.amount
      ORDER BY t.id)
SELECT tx1.id, tx1.amount
  FROM tx tx1
   LEFT JOIN tx tx2
       ON     tx1.id = tx2.id
          AND tx1.ROW_NUMBER > tx2.ROW_NUMBER
WHERE tx2.ROW_NUMBER IS NULL

edited Jan 20, 2016 at 8:26

answered Jan 19, 2016 at 12:45

Aleksej

23.1k6 gold badges38 silver badges41 bronze badges

3 Comments

Anton Over a year ago

I honestly tried to execute this query, but didn't manage as SQL Developer cannot finish query within even 5 minutes, which is too much. Thank you for your effort

user330315 Over a year ago

LEFT JOIN tx tx2 ON tx1.id = tx2.id(+) looks terribly wrong. You should use either a proper left join or Oracle's proprietary (+) operator but not both in the same join condition.

Aleksej Over a year ago

ops.. my error. The operator (+) is wrong, just removed. Thanks

Collectives™ on Stack Overflow

How can I select unique values from several columns in Oracle SQL?

5 Answers 5

3 Comments

2 Comments

1 Comment

1 Comment

3 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

3 Comments

2 Comments

1 Comment

1 Comment

3 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related