SQL Query to get column values that correspond with MAX value of another column?

Question

Ok, this is my query:

SELECT
  video_category,
  video_url,
  video_date,
  video_title,
  short_description,
  MAX(video_id) 
FROM
  videos
GROUP BY
  video_category

When it pulls the data, I get the correct row for the video_id, but it pulls the first row for each category for the others. So when I get the max result for the video_id of category 1, I get the max ID, but the first row in the table for the url, date, title, and description.

How can I have it pull the other columns that correspond with the max ID result?

Edit: Fixed.

SELECT
    *
FROM
    videos
WHERE
    video_id IN
    (
        SELECT
            DISTINCT
            MAX(video_id)
        FROM
            videos
        GROUP BY
            video_category
    ) 
ORDER BY
    video_category ASC

@carillonator its not.. its actually redundant, as MAX() will provide a unique result.. obviously — Atticus
– Atticus, Commented Sep 21, 2012 at 1:34

Community · Accepted Answer · 2023-06-16 15:27:07Z

83

I would try something like this:

SELECT
   s.video_id
   ,s.video_category
   ,s.video_url
   ,s.video_date
   ,s.video_title
   ,s.short_description
FROM videos s
   JOIN (SELECT MAX(video_id) AS id FROM videos GROUP BY video_category) max
      ON s.video_id = max.id

which is quite a lot faster than your own solution

edited Jun 16, 2023 at 15:27

CommunityBot

11 silver badge

answered Jul 24, 2011 at 16:09

Dalen

9,0264 gold badges49 silver badges53 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Devin Over a year ago

Not sure how it's faster, but I'll use it. Still works, and that's all I care about. Thanks.

Dalen Over a year ago

I tested both on a similar table I have, mine took 0.02s yours 0.19s :)

Paul Over a year ago

This only works if there is only one max video_id for each video category. Assuming the OP would arbitrarily select mong these duplicates, what is the solution to the more general problem?

d0ug7a5 Over a year ago

FWIW this approach helped me with a similar problem, many thanks

Doug Davis May 30 at 14:52

Hello from 2025, when this response just helped me out.

Steven Moseley · Accepted Answer · 2025-02-20 14:16:25Z

44

I recently invented a new technique to simulate Window Functions in MySQL. I call it Scalar-Aggregate Reduction.

This is by far the highest-performance approach and simplest method (in DB engine terms) for accomplishing this, because it requires no joins, no subqueries, and no CTE.

For your query, it would look something like this:

SELECT
  video_category,
  MAX(video_id) AS video_id,
  SUBSTRING(MAX(CONCAT(LPAD(video_id, 11, '0'), video_url)), 12) AS video_url,
  SUBSTRING(MAX(CONCAT(LPAD(video_id, 11, '0'), video_date)), 12) AS video_date,
  SUBSTRING(MAX(CONCAT(LPAD(video_id, 11, '0'), video_title)), 12) AS video_title,
  SUBSTRING(MAX(CONCAT(LPAD(video_id, 11, '0'), short_description)), 12) AS short_description
FROM
  videos
GROUP BY
  video_category

The combination of scalar and aggregate functions does the following:

LPADs the intra-aggregate correlated identifier to allow proper string comparison (e.g. "0009" and "0025" will be properly ranked). I'm LPADDING to 11 characters here assuming an INT primary key. If you use a BIGINT, you will want to increase this to support your table's ordinality. If you're comparing on a DATETIME field (fixed length), no padding is necessary.
CONCATs the padded identifier with the output column (so you get "00000000009myvalue" vs "0000000025othervalue")
MAX the aggregate set, which will yield "00000000025othervalue" as the winner.
SUBSTRING the result, which will truncate the compared identifier portion, leaving only the value.

If you want to retrieve values in types other than CHAR, you may need to performa an additional CAST on the output, e.g. if you want video_date to be a DATETIME:

CAST(SUBSTRING(MAX(CONCAT(LPAD(video_id, 11, '0'), video_date)), 12) AS DATETIME)

Another benefit of this method over the self-joining method is that you can combine other aggregate data (not just latest values), or even combine first AND last item in the same query, e.g.

SELECT
    -- Overall totals
    video_category,
    COUNT(1) AS videos_in_category,
    DATEDIFF(MAX(video_date), MIN(video_date)) AS timespan,
    
    -- Last video details
    MAX(video_id) AS last_video_id,
    SUBSTRING(MAX(CONCAT(LPAD(video_id, 11, '0'), video_url)), 12) AS last_video_url,
    ...
    
    -- First video details
    MIN(video_id) AS first_video_id,
    SUBSTRING(MIN(CONCAT(LPAD(video_id, 11, '0'), video_url)), 12) AS first_video_url,
    ...
    
    -- And so on

For further details explaining the benefits of this method vs other older methods, my full blog post is here: https://www.stevenmoseley.com/blog/tech/high-performance-sql-correlated-scalar-aggregate-reduction-queries

edited Feb 20 at 14:16

answered Nov 4, 2019 at 16:55

Steven Moseley

16.4k4 gold badges42 silver badges50 bronze badges

11 Comments

Steven Moseley Over a year ago

FYI, benchmarked on a 1-million row production table in Aurora, Scalar Aggregate Comparison performed 20% better than the Subquery method.

Mzril Over a year ago

This is amazing, and EXACTLY what I was looking for. Thank you for this. I look forward to finding additional use cases for this method.

ARM07470 Over a year ago

I just spent a few hours optimizing a similar query on SQL Server 2014 and was coming on here to report the performance improvements that can be gained from this technique but found that you beat me to it. In my case, I was looking for the first & last date in an activity log along with the person who performed it. The T-SQL expressions I used to find the first action date and person were MIN(ActivityDate) FirstActivityDate and SUBSTRING(MIN(CONVERT(VARCHAR, ActivityDate, 21) + ActivityBy), 24, 256) FirstActivityBy. I got a 3X performance improvement over the CTE method for 130,000 rows.

tuxedobob Over a year ago

How do we get this baked into MySQL? I shouldn’t have to do this myself.

pigi5 Over a year ago

Hah of course, I forgot about the negative sign. Thanks for the great solution!

|

Lou · Accepted Answer · 2021-06-23 16:39:36Z

11

A slightly more "rustic" solution, but should do the job just the same:

SELECT
  video_category,
  video_url,
  video_date,
  video_title,
  short_description,
  video_id
FROM
  videos
ORDER BY video_id DESC
LIMIT 1;

In other words, just produce a table with all of the columns that you want, sort it so that your maximum value is at the top, and chop it off so you only return one row.

answered Jun 23, 2021 at 16:39

Lou

2,5272 gold badges45 silver badges84 bronze badges

1 Comment

G-Force Over a year ago

was a great option for me. simple, elegant, and the sort and limit was perfect. thanks!

Guillaume Massé · Accepted Answer · 2019-04-28 13:30:42Z

5

Here is a more general solution (handles duplicates)

CREATE TABLE test(
  i INTEGER,
  c INTEGER,
  v INTEGER
);


insert into test(i, c, v)
values
(3, 1, 1),
(3, 2, 2),
(3, 3, 3),
(4, 2, 4),
(4, 3, 5),
(4, 4, 6),
(5, 3, 7),
(5, 4, 8),
(5, 5, 9),
(6, 4, 10),
(6, 5, 11),
(6, 6, 12);



SELECT t.c, t.v
FROM test t
JOIN (SELECT test.c, max(i) as mi FROM test GROUP BY c) j ON
  t.i = j.mi AND
  t.c  = j.c
ORDER BY c;

answered Apr 28, 2019 at 13:30

Guillaume Massé

8,1428 gold badges49 silver badges57 bronze badges

1 Comment

Thykof Over a year ago

the output is ` c,v 1,1 2,4 3,7 4,10 5,11 6,12 `

Ghareeb Nawaz · Accepted Answer · 2020-11-18 20:31:11Z

-1

SELECT video_category,video_url,video_date,video_title,short_description,video_id FROM videos t1 where video_id in (SELECT max(video_id) FROM videos t2 WHERE t1.video_category=t2.video_category );

Please provide your input and output records so that it can be understood properly and tested.

answered Nov 18, 2020 at 20:31

Ghareeb Nawaz

292 bronze badges

1 Comment

Yunnosch Over a year ago

This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker.

Collectives™ on Stack Overflow

SQL Query to get column values that correspond with MAX value of another column?

5 Answers 5

5 Comments

11 Comments

1 Comment

1 Comment

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

5 Comments

11 Comments

1 Comment

1 Comment

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related