Postgres: select all row with count of a field greater than 1

Question

i have table storing product price information, the table looks similar to, (no is the primary key)

no   name    price    date
1    paper   1.99     3-23
2    paper   2.99     5-25
3    paper   1.99     5-29
4    orange  4.56     4-23
5    apple   3.43     3-11

right now I want to select all the rows where the "name" field appeared more than once in the table. Basically, i want my query to return the first three rows.

I tried:

SELECT * FROM product_price_info GROUP BY name HAVING COUNT(*) > 1

but i get an error saying:

column "product_price_info.no" must appear in the GROUP BY clause or be used in an aggregate function

Juan Carlos Oropeza · Accepted Answer · 2016-04-01 14:27:38Z

93

SELECT * 
FROM product_price_info 
WHERE name IN (SELECT name 
               FROM product_price_info 
               GROUP BY name HAVING COUNT(*) > 1)

answered Apr 1, 2016 at 14:27

Juan Carlos Oropeza

48.4k14 gold badges87 silver badges128 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Madbreaks Over a year ago

Better (as in faster) to use e.g. COUNT(id) per the official docs.

Diego Souza Over a year ago

You could join "name" inside the subquery with the group by, and use the "exists" clause instead of "in". I think it's faster

Burst Over a year ago

@Madbreaks, where exactly in documentation?

Giorgos Betsos · Accepted Answer · 2016-04-01 14:27:03Z

13

Try this:

SELECT no, name, price, "date"
FROM (
  SELECT no, name, price, "date",
         COUNT(*) OVER (PARTITION BY name) AS cnt 
  FROM product_price_info ) AS t
WHERE t.cnt > 1

You can use the window version of COUNT to get the population of each name partition. Then, in an outer query, filter out name partitions having a population that is less than 2.

answered Apr 1, 2016 at 14:27

Giorgos Betsos

72.3k10 gold badges69 silver badges103 bronze badges

1 Comment

cyfex Over a year ago

I have tested the versions from Juan Carlos Oropeza, Giorgos Betsos, and jarlh in SQLite3. This version is the fastest one. It's 26% faster than the other two.

Jeff C Johnson · Accepted Answer · 2019-08-28 17:41:45Z

13

Window Functions are really nice for this.

SELECT p.*, count(*) OVER (PARTITION BY name) FROM product p;

For a full example:

CREATE TABLE product (no SERIAL, name text, price NUMERIC(8,2), date DATE);

INSERT INTO product(name, price, date) values
('paper', 1.99, '2017-03-23'),
('paper', 2.99, '2017-05-25'),
('paper', 1.99, '2017-05-29'),
('orange', 4.56, '2017-04-23'),
('apple', 3.43, '2017-03-11')
;

WITH report AS (
  SELECT p.*, count(*) OVER (PARTITION BY name) as count FROM product p
)
SELECT * FROM report WHERE count > 1;

Gives:

 no |  name  | price |    date    | count
----+--------+-------+------------+-------
  1 | paper  |  1.99 | 2017-03-23 |     3
  2 | paper  |  2.99 | 2017-05-25 |     3
  3 | paper  |  1.99 | 2017-05-29 |     3
(3 rows)

edited Aug 28, 2019 at 17:41

answered Sep 25, 2017 at 15:41

Jeff C Johnson

2273 silver badges10 bronze badges

2 Comments

Madbreaks Over a year ago

Nice but this doesn't fully answer op's question

Jeff C Johnson Over a year ago

@Madbreaks thanks for catching that. I updated the answer.

jarlh · Accepted Answer · 2016-04-01 14:27:23Z

3

Self join version, use a sub-query that returns the name's that appears more than once.

select t1.*
from tablename t1
join (select name from tablename group by name having count(*) > 1) t2
  on t1.name = t2.name

Basically the same as IN/EXISTS versions, but probably a bit faster.

answered Apr 1, 2016 at 14:27

jarlh

44.9k8 gold badges52 silver badges68 bronze badges

Comments

Vitaliy Turkevich · Accepted Answer · 2022-07-13 10:48:25Z

1

SELECT name, count(name)
FROM product_price_info
GROUP BY name
HAVING COUNT(name) > 1
LIMIT 3

answered Jul 13, 2022 at 10:48

Vitaliy Turkevich

411 bronze badge

2 Comments

Z4-tier Over a year ago

This doesn't do what the question asks for. It should select all columns for each row where the name value appears more than once. It should not add aggregation or new columns.

Mike Graf Over a year ago

Funny enough this is the answer to the question I asked google and it gave me this SO

Collectives™ on Stack Overflow

Postgres: select all row with count of a field greater than 1

5 Answers 5

3 Comments

1 Comment

2 Comments

Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

3 Comments

1 Comment

2 Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related