Finding consecutive patterns (with SQL)

Question

A table consecutive in PostgreSQL: Each se_id has an idx from 0 up to 100 - here 0 to 9.

The search pattern:

SELECT *
FROM consecutive
WHERE val_3_bool = 1
AND val_1_dur > 4100 AND val_1_dur < 5900

Now I'm looking for the longest consecutive appearance of this pattern for each p_id - and the AVG of the counted val_1_dur.

Is it possible to calculate this in pure SQL?

table as txt "Result" as txt

Post the tables and data as text READ THIS to understand why — Juan Carlos Oropeza
– Juan Carlos Oropeza, Commented Nov 6, 2017 at 14:36
stackoverflow.com/questions/tagged/gaps-and-islands+postgresql — user330315
– user330315, Commented Nov 6, 2017 at 14:41

Gordon Linoff · Accepted Answer · 2017-11-06 14:41:47Z

3

One method is the difference of row numbers approach to get the sequences for each:

select pid, count(*) as in_a_row, sum(val1_dur) as dur
from (select t.*,
             row_number() over (partition by pid order by idx) as seqnum,
             row_number() over (partition by pid, val3_bool order by idx) as seqnum_d
      from consecutive t
     ) t
group by (seqnun - seqnum_d), pid, val3_bool;

If you are looking specifically for "1" values, then add where val3_bool = 1 to the outer query. To understand why this works, I would suggest that you stare at the results of the subquery, so you can understand why the difference defines the consecutive values.

You can then get the max using distinct on:

select distinct on (pid) t.*
from (select pid, count(*) as in_a_row, sum(val1_dur) as dur
      from (select t.*,
                   row_number() over (partition by pid order by idx) as seqnum,
                   row_number() over (partition by pid, val3_bool order by idx) as seqnum_d
            from consecutive t
           ) t
      group by (seqnun - seqnum_d), pid, val3_bool;
     ) t
order by pid, in_a_row desc;

The distinct on does not require an additional level of subquery, but I think that makes the logic clearer.

answered Nov 6, 2017 at 14:41

Gordon Linoff

1.3m62 gold badges706 silver badges857 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Teletubbi-OS X Over a year ago

Sounds interesing. ;) But I’m looking only for the consecutive matching of a specific pattern.

Gordon Linoff Over a year ago

@Teletubbi-OSX . . . This does what you want. Just add the where clause to the query before the group by, as explained in the answer.

Teletubbi-OS X Over a year ago

Yes, it works! Thanks. But this procedure is a bit slow. It takes ~30sec. (table size 750MB).

Franco Pan · Accepted Answer · 2017-11-06 14:40:02Z

0

There are Window Functions, that enable you to compare one line with the previous and next one.

https://community.modeanalytics.com/sql/tutorial/sql-window-functions/ https://www.postgresql.org/docs/current/static/tutorial-window.html

As seen on How to compare the current row with next and previous row in PostgreSQL? and Filtering by window function result in Postgresql

edited Nov 6, 2017 at 14:40

user330315

answered Nov 6, 2017 at 14:38

Franco Pan

1482 silver badges12 bronze badges

Collectives™ on Stack Overflow

Finding consecutive patterns (with SQL)

2 Answers 2

3 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

3 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related