Index Help (Postgresql Not Equal Text)

Question

Using Postgresql 9.5, wanting to know the best way to index a table so that the following query will not use a sequential scan.

SELECT *
FROM MYTABLE
WHERE (
COL1 NOT IN ('980','982','983','984','985','986','987','988','989','990','991','996','997')
OR
COL2 <> '999'
)

And why do you think a seq scan is bad? Please edit your question and add the execution plan generated using explain (analyze, verbose). Formatted text please, no screen shots — user330315
– user330315, Commented Feb 16, 2017 at 20:57

Markus Benko · Accepted Answer · 2017-02-16 21:29:00Z

Just define a partial index with exactly the same where clause as used in your select statement:

CREATE INDEX ON mytable ((col1 IS NULL)) WHERE (
    COL1 NOT IN ('980','982','983','984','985','986','987','988','989','990','991','996','997')
    OR
    COL2 <> '999'
)

The expression col1 IS NULL is just a fill-in which yields a single boolean value that is stored in the index because it's compact (just one byte per row) and it's not clear which columns would make sense to be included. If you keep querying all columns and you don't care about occupying disk space by virtually duplicating all data then including all columns in the index could make sense performance-wise.

Edit: Sequential scans are not bad per se. Generally when PostgreSQL favours a sequential scan over using an existing index there's a good reason for it. If not, the configuration could be tuned to reflect its server environment more accurately.

Collectives™ on Stack Overflow

Index Help (Postgresql Not Equal Text)

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related