PostgreSQL count number of times substring occurs in text

Question

I'm writing a PostgreSQL function to count the number of times a particular text substring occurs in another piece of text. For example, calling count('foobarbaz', 'ba') should return 2.

I understand that to test whether the substring occurs, I use a condition similar to the below:

    WHERE 'foobarbaz' like '%ba%'

However, I need it to return 2 for the number of times 'ba' occurs. How can I proceed?

Thanks in advance for your help.

Check out my answer for an updated method of doing this stackoverflow.com/a/42708237/124486 — Evan Carroll
– Evan Carroll, Commented Mar 10, 2017 at 0:55

Community · Accepted Answer · 2017-04-13 12:42:40Z

I would highly suggest checking out this answer I posted to "How do you count the occurrences of an anchored string using PostgreSQL?". The chosen answer was shown to be massively slower than an adapted version of regexp_replace(). The overhead of creating the rows, and the running the aggregate is just simply too high.

The fastest way to do this is as follows...

SELECT
  (length(str) - length(replace(str, replacestr, '')) )::int
  / length(replacestr)
FROM ( VALUES
  ('foobarbaz', 'ba')
) AS t(str, replacestr);

Here we

Take the length of the string, L1
Subtract from L1 the length of the string with all of the replacements removed L2 to get L3 the difference in string length.
Divide L3 by the length of the replacement to get the occurrences

For comparison that's about five times faster than the method of using regexp_matches() which looks like this.

SELECT count(*)
FROM ( VALUES
  ('foobarbaz', 'ba')
) AS t(str, replacestr)
CROSS JOIN LATERAL regexp_matches(str, replacestr, 'g');

Mike T · Accepted Answer · 2014-09-10 04:47:55Z

10

How about use a regular expression:

SELECT count(*)
FROM regexp_matches('foobarbaz', 'ba', 'g');

The 'g' flag repeats multiple matches on a string (not just the first).

answered Sep 10, 2014 at 4:47

Mike T

44.4k18 gold badges166 silver badges213 bronze badges

1 Comment

Evan Carroll Over a year ago

Check out my answer here for an update to this question and a comparison of both this method and an optimal way of doing this. Or, my answer to another question on DBA.SE, "How do you count the occurrences of an anchored string using PostgreSQL?".

Andreas Covidiot · Accepted Answer · 2016-04-25 09:17:01Z

1

There is a

str_count( src,  occurence )

function based on

SELECT (length( str ) - length(replace( str, occurrence, '' ))) / length( occurence )

and a

str_countm( src, regexp )

based on the @MikeT-mentioned

SELECT count(*) FROM regexp_matches( str, regexp, 'g')

available here: postgres-utils

answered Apr 25, 2016 at 9:17

Andreas Covidiot

4,8335 gold badges55 silver badges106 bronze badges

Comments

atiruz · Accepted Answer · 2016-10-03 20:57:12Z

1

Try with:

SELECT array_length (string_to_array ('1524215121518546516323203210856879', '1'), 1) - 1

--RESULT: 7

answered Oct 3, 2016 at 20:57

atiruz

2,87830 silver badges37 bronze badges

Comments

Kouber Saparev · Accepted Answer · 2025-10-07 11:17:23Z

0

You can use regexp_count.

SELECT regexp_count('foobarbaz', 'ba');

The above command will give you the number 2.

answered Oct 7 at 11:17

Kouber Saparev

8,2552 gold badges32 silver badges28 bronze badges

Collectives™ on Stack Overflow

PostgreSQL count number of times substring occurs in text

5 Answers 5

Comments

1 Comment

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

1 Comment

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related