"SINGLE" aggregation function in SQL Server

Question

Moderately frequently, I find myself doing a grouping, that I know will result in the whole group having the same value in a particular column, but SQL Server doesn't know that.

Most often, it's that I've grouped by DATEPART(Month, my_date_column) and then I want to SELECT DATEPART(Year, my_date_column) where all the data is in a single year or SELECT DATENAME(Month, my_date_column)

SQL Server doesn't know that these are implicitly all identical, so I end up using MIN() or MAX().

This works, but it feels wrong. (And misleading for future developers!)

Is there a SINGLE() function or anything comparable?

Ideally it would error if they weren't all unique, but I'd taking anything that was more explicit about what I was doing.

To document your intentions, just group by it like everything else. — Nick.Mc
– Nick.Mc, Commented Mar 23, 2017 at 10:36
SINGLE is the misleading function. What would SINGLE do if it encountered a different value? It can't throw random errors. It could only work if somehow you ensured that all results were identical, as if you called DISTINCT on them. That's not how aggregates are expected to work — Panagiotis Kanavos
– Panagiotis Kanavos, Commented Mar 23, 2017 at 10:37
SQL, the language, deals with data sets. Aggregate functions work on an entire set and produce a result. It's OK for a function to throw when invalid data are encountered, like NaN or NULL. It's not OK if that happens at random based on the order or distribution of the data. For such a function to work deterministically the rest of the query would have to ensure that all values are identical — Panagiotis Kanavos
– Panagiotis Kanavos, Commented Mar 23, 2017 at 10:44

Damien_The_Unbeliever · Accepted Answer · 2017-03-23 10:13:21Z

1

I just use MIN. There are only 13 aggregate functions and there's nothing that is more suitable.

If you wish to document that the result should be unique for the group and that multiple values are an error, put a tripwire in:

...
MIN(Expression) as a,
CASE WHEN MIN(Expression) != MAX(Expression) THEN 1/0 END as EnsureUnique,
...

The alternative is to write your own CLR Aggregate function for this.

answered Mar 23, 2017 at 10:13

Damien_The_Unbeliever

241k28 gold badges358 silver badges470 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Panagiotis Kanavos Over a year ago

A SQLCLR aggregate that randomly throws after the results are selected isn't the best idea. Which is why SQL, the language, doesn't have such an aggregate function. You can't have an aggregate function that may throw even though the data is perfectly valid (ie not NULL or extremes). It would also cause serious performance problems

Damien_The_Unbeliever Over a year ago

@PanagiotisKanavos - personally, I'd just use the MIN and not be looking to generate errors. Having said that, I'd still prefer any variation of this aggregate over the mysql "it's not in an aggregate, it's not in a GROUP BY, I'll just give you a random value from one of the rows".

Panagiotis Kanavos Over a year ago

You could also use FIRST_VALUE, if it doesn't introduce additional sorting. MySQL doesn't have windowing functions so it uses various tricks like previous + 1 to calculate row numbers

Collectives™ on Stack Overflow

"SINGLE" aggregation function in SQL Server

1 Answer 1

3 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

3 Comments

Your Answer

Sign up or log in

Post as a guest

Related