querying json keys which intersect in postgres jsonb

Question

How do i query for jsonb keys which intersect:

Ex:

     kv                                 |        column1                   
-----------------------------------------------------------
[{"k1": "v1"}, {"k2": "v22"}]           | web
[{"k10": "v5"}, {"k9": "v21"}]          | mobile
[{"k1": "v1"}, {"k5": "v24"}]           | web1
[{"k5": "v1"}, {"k55": "v24"}]          | web1

here, row 1 and row 3 has key k1 and row 3 and row 4 has key k5.

So, the answer should be rows 1,3 & 4.

Why do you have a column named column? Is that "column" related to your question? — user330315
– user330315, Commented Aug 31, 2018 at 7:01

score 1 · Accepted Answer · 2018-08-31 07:49:48Z

Assuming the following setup:

create table data (id serial, kv jsonb, col1 text);

insert into data (kv, col1)
values
('[{"k1": "v1"}, {"k2": "v22"}]', 'web'),
('[{"k10": "v5"}, {"k9": "v21"}]', 'mobile'),
('[{"k1": "v1"}, {"k5": "v24"}]', 'web1'),
('[{"k5": "v1"}, {"k55": "v24"}]', 'web1');

You can get those rows by first normalizing the data, then doing a self join on the normalized data. To normalize the data you need to unnest the JSON values twice: once for flattening the arrays and then another time to extract the keys from the JSON values:

with normalized as (
  select d.id, t2.*
  from data d
    join jsonb_array_elements(kv) as t1(kv) on true
    join jsonb_each_text(t1.kv) as t2(k,val) on true
)
select n1.*
from normalized n1
where exists (select *
              from normalized n2
              where n1.id <> n2.id 
                and n1.k = n2.k);

The above returns:

id | k  | val
---+----+----
 1 | k1 | v1 
 3 | k1 | v1 
 3 | k5 | v24
 4 | k5 | v1

Or use it with an IN condition to get the original rows:

with normalized as (
  select d.id, t2.*
  from data d
    join jsonb_array_elements(kv) as t1(kv) on true
    join jsonb_each_text(t1.kv) as t2(k,val) on true
)
select *
from data
where id in (select n1.id
            from normalized n1
            where exists (select *
                          from normalized n2
                          where n1.id <> n2.id 
                            and n1.k = n2.k))

returns:

id | kv                             | col1
---+--------------------------------+-----
 1 | [{"k1": "v1"}, {"k2": "v22"}]  | web 
 3 | [{"k1": "v1"}, {"k5": "v24"}]  | web1
 4 | [{"k5": "v1"}, {"k55": "v24"}] | web1

This type of query would be easier if you didn't store the key/value pairs in an array, '{"k1": "v1", "k2": "v22"}' would make a lot more sense to me than [{"k1": "v1"}, {"k2": "v22"}]

i agree with how it should be stored. this is an already existing table and has to be backward compatible. cannot migrate to the new structure yet.

Rémy Baron · Accepted Answer · 2018-08-31 08:00:02Z

1

You can try this :

--This part is to simulate your table
with yourTable as (
select (string_to_array(t,'|'))[1]::jsonb kv,(string_to_array(t,'|'))[2] column1 from (
select unnest(string_to_array($$[{"k1": "v1"}, {"k2": "v22"}]           | web
[{"k10": "v5"}, {"k9": "v21"}]          | mobile
[{"k1": "v1"}, {"k5": "v24"}]           | web1
[{"k5": "v1"}, {"k55": "v24"}]          | web1$$::character varying,E'\n')) t

) b
) 
-- This is your request :
   select distinct kv,column1 from (
        select *,count(*) over (partition by elt) nb_inter from (
          select kv,column1,jsonb_object_keys(jsonb_array_elements(kv)) elt from yourTable
          ) a 
        ) b
where nb_inter >1

edited Aug 31, 2018 at 8:00

answered Aug 31, 2018 at 7:27

Rémy Baron

1,4099 silver badges15 bronze badges

Collectives™ on Stack Overflow

querying json keys which intersect in postgres jsonb

2 Answers 2

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related