SQL Delete Duplicate with no SELECT by Cartesian product
Sample Situation
Delete Duplicate Emails - LeetCode
Table: Person
|
|
Write an SQL query to delete all the duplicate emails, keeping only one unique email with the smallest id
. Note that you are supposed to write a DELETE
statement and not a SELECT
one.
After running your script, the answer shown is the Person
table. The driver will first compile and run your piece of code and then show the Person
table. The final order of the Person
table does not matter.
The query result format is in the following example.
Example 1:
Input:
Person table:
+----+------------------+
| id | email |
+----+------------------+
| 1 | john@example.com |
| 2 | bob@example.com |
| 3 | john@example.com |
+----+------------------+
Output:
+----+------------------+
| id | email |
+----+------------------+
| 1 | john@example.com |
| 2 | bob@example.com |
+----+------------------+
Explanation: john@example.com is repeated two times. We keep the row with the smallest Id = 1.
Solution
|
|
Explanation
Input:
Person table:
+----+------------------+
| id | email |
+----+------------------+
| 1 | john@example.com |
| 2 | bob@example.com |
| 3 | john@example.com |
+----+------------------+
=> From Person p1, Person p2
(Cartesian product1): it would look like:
|
|
=> From Person p1, Person p2 where p1.email=p2.email and p1.id>p2.id
:
It would look like:
3|john@example.com
Now delete this row’s matching row in p1 using p1: delete p1