MySQL DELETE query with conditions

Question

I have a table PEOPLE, with columns 'firstName' 'lastName' (varchars) and 'deleted' (bit) amongst others.

I want to delete from this table, entries that have the property TRUE for deleted, but only if they share their exact firstName and lastName with another, separate, entry in the table.

In other words, remove from the table 'deleted' people, but only if they are a duplicate.

Not sure how to do this, and especially not how to do it quickly. Any help is appreciated, thanks.

How can you tell which is duplicate and which is primary record? — AJ.
– AJ., Commented Jun 27, 2011 at 17:32

niktrs · Accepted Answer · 2011-06-27 17:39:42Z

3

DELETE FROM people
WHERE EXISTS (
    SELECT *
    FROM people p2
    WHERE people.firstName = p2.firstName AND people.lastName = p2.lastName
    GROUP BY firstName, lastName
    HAVING COUNT(*)>1
)
AND deleted = 1 -- True

answered Jun 27, 2011 at 17:39

niktrs

10.1k1 gold badge32 silver badges32 bronze badges

Sign up to request clarification or add additional context in comments.

9 Comments

Dirk Over a year ago

do you need the where clause in the nested statement?

niktrs Over a year ago

Yes, to join the subquery with the outer table.

Naftali Over a year ago

@niktrs, this will take a loooong time is the table is huge.

Abhay Over a year ago

won't this query delete all entries that have deleted = 1?

niktrs Over a year ago

We are asked "I want to delete from this table, entries that have the property TRUE for deleted, but only if they share their exact firstName and lastName with another", so we want deleted=1 and every lastName, firstName count > 1

|

Clockwork-Muse · Accepted Answer · 2014-06-12 08:37:11Z

1

If your table has a unique primary key (... will depend on design...), then this is a viable alternative to needing to count the occurrances of entries:

DELETE FROM people as A
WHERE deleted = 1
AND EXISTS (SELECT '1'
            FROM people as B
            WHERE B.id <> A.id
            AND A.firstName = B.firstName
            AND A.lastName = B.lastName)

This may have slightly better performance than counting rows. Please note that this query will likely suffer the same possible issue present in the previous answer; specifically, if there are two or more 'deleted' rows, and no 'non-deleted', both of them will probably be removed (leaving you with no rows!). If the intent of the query is only to remove 'deleted' rows when there is a 'non-deleted' equivalent row, add AND B.deleted = 0 as part of the inner WHERE clause.

edited Jun 12, 2014 at 8:37

answered Jun 27, 2011 at 21:07

Clockwork-Muse

13.2k6 gold badges33 silver badges50 bronze badges

3 Comments

Chris Cunningham Over a year ago

Great -- this one allows an easy AND B.deleted = 0 fix that I suspect the questioner wants, where the other doesn't.

niktrs Over a year ago

Suggestion: Use A.id>B.id, so everything newer than the first record will be deleted. Also performs faster.

Clockwork-Muse Over a year ago

@niktrs - Unfortunately, that presumes that only later (or earlier) ids are ever 'deleted'. Depending on design and use, this assumption may or may not be valid. But yes, otherwise, that would likely perform better.

Naftali · Accepted Answer · 2011-06-27 17:31:50Z

0

Here is a rudimentary way of doing it:

http://www.justin-cook.com/wp/2006/12/12/remove-duplicate-entries-rows-a-mysql-database-table/

Basically:
1. Create a new table with GROUP BY.
2. Delete old table.
3. Rename new table.

answered Jun 27, 2011 at 17:31

Naftali

147k41 gold badges247 silver badges304 bronze badges

Collectives™ on Stack Overflow

MySQL DELETE query with conditions

3 Answers 3

9 Comments

3 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

9 Comments

3 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related