SO LÖSCHEN SIE DOPPELTE ZEILEN IN SQL

In diesem Abschnitt lernen wir verschiedene Möglichkeiten kennen, doppelte Zeilen zu löschen MySQL und Oracle . Wenn die SQL Wenn die Tabelle doppelte Zeilen enthält, müssen wir die doppelten Zeilen entfernen.

Beispieldaten vorbereiten

Das Skript erstellt die Tabelle mit dem Namen Kontakte .

 DROP TABLE IF EXISTS contacts; CREATE TABLE contacts ( id INT PRIMARY KEY AUTO_INCREMENT, first_name VARCHAR(30) NOT NULL, last_name VARCHAR(25) NOT NULL, email VARCHAR(210) NOT NULL, age VARCHAR(22) NOT NULL );

In die obige Tabelle haben wir die folgenden Daten eingefügt.

 INSERT INTO contacts (first_name,last_name,email,age) VALUES (&apos;Kavin&apos;,&apos;Peterson&apos;,&apos;[email protected]&apos;,&apos;21&apos;), (&apos;Nick&apos;,&apos;Jonas&apos;,&apos;[email protected]&apos;,&apos;18&apos;), (&apos;Peter&apos;,&apos;Heaven&apos;,&apos;[email protected]&apos;,&apos;23&apos;), (&apos;Michal&apos;,&apos;Jackson&apos;,&apos;[email protected]&apos;,&apos;22&apos;), (&apos;Sean&apos;,&apos;Bean&apos;,&apos;[email protected]&apos;,&apos;23&apos;), (&apos;Tom &apos;,&apos;Baker&apos;,&apos;[email protected]&apos;,&apos;20&apos;), (&apos;Ben&apos;,&apos;Barnes&apos;,&apos;[email protected]&apos;,&apos;17&apos;), (&apos;Mischa &apos;,&apos;Barton&apos;,&apos;[email protected]&apos;,&apos;18&apos;), (&apos;Sean&apos;,&apos;Bean&apos;,&apos;[email protected]&apos;,&apos;16&apos;), (&apos;Eliza&apos;,&apos;Bennett&apos;,&apos;[email protected]&apos;,&apos;25&apos;), (&apos;Michal&apos;,&apos;Krane&apos;,&apos;[email protected]&apos;,&apos;25&apos;), (&apos;Peter&apos;,&apos;Heaven&apos;,&apos;[email protected]&apos;,&apos;20&apos;), (&apos;Brian&apos;,&apos;Blessed&apos;,&apos;[email protected]&apos;,&apos;20&apos;); (&apos;Kavin&apos;,&apos;Peterson&apos;,&apos;[email protected]&apos;,&apos;30&apos;),

Wir führen das Skript aus, um Testdaten nach der Ausführung von a neu zu erstellen LÖSCHEN Stellungnahme .

Die Abfrage gibt Daten aus der Kontakttabelle zurück:

 SELECT * FROM contacts ORDER BY email;

Ausweis	Vorname	Familienname, Nachname	Email	Alter
7	Ben	Barnes	[email protected]	einundzwanzig
13	Brian	Gesegnet	[email protected]	18
10	Eliza	Bennett	[email protected]	23
1	Kavin	Peterson	[email protected]	22
14	Kavin	Peterson	[email protected]	23
8	Mischa	Barton	[email protected]	zwanzig
elf	Michael	Wasserhähne	[email protected]	17
4	Michael	Jackson	[email protected]	18
2	Nick	Jonas	[email protected]	16
3	Peter	Himmel	[email protected]	25
12	Peter	Himmel	[email protected]	25
5	Sean	Bohne	[email protected]	zwanzig
9	Sean	Bohne	[email protected]	zwanzig
6	Tom	Bäcker	[email protected]	30

Die folgende SQL-Abfrage gibt die doppelten E-Mails aus der Kontakttabelle zurück:

 SELECT email, COUNT(email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1;

Email	COUNT(E-Mail)
[email protected]	2
[email protected]	2
[email protected]	2

Wir haben drei Reihen mit Duplikat E-Mails.

(A) Löschen Sie doppelte Zeilen mit der DELETE JOIN-Anweisung

 DELETE t1 FROM contacts t1 INNERJOIN contacts t2 WHERE t1.id <t2.id and t1.email="t2.email;" < pre> <p> <strong>Output:</strong> </p> <pre> Query OK, three rows affected (0.10 sec) </pre> <p>Three rows had been deleted. We execute the query, given below to finds the <strong>duplicate emails</strong> from the table.</p> <pre> SELECT email, COUNT (email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1; </pre> <p>The query returns the empty set. To verify the data from the contacts table, execute the following SQL query:</p> <pre> SELECT * FROM contacts; </pre> <br> <table class="table"> <tr> <td>id</td> <td>first_name</td> <td>last_name</td> <td>Email</td> <td>age</td> </tr> <tr> <td>7</td> <td>Ben</td> <td>Barnes</td> <td> [email protected] </td> <td>21</td> </tr> <tr> <td>13</td> <td>Brian</td> <td>Blessed</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>10</td> <td>Eliza</td> <td>Bennett</td> <td> [email protected] </td> <td>23</td> </tr> <tr> <td>1</td> <td>Kavin</td> <td>Peterson</td> <td> [email protected] </td> <td>22</td> </tr> <tr> <td>8</td> <td>Mischa</td> <td>Barton</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>11</td> <td>Micha</td> <td>Krane</td> <td> [email protected] </td> <td>17</td> </tr> <tr> <td>4</td> <td>Michal</td> <td>Jackson</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>2</td> <td>Nick</td> <td>Jonas</td> <td> [email protected] </td> <td>16</td> </tr> <tr> <td>3</td> <td>Peter</td> <td>Heaven</td> <td> [email protected] </td> <td>25</td> </tr> <tr> <td>5</td> <td>Sean</td> <td>Bean</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>6</td> <td>Tom</td> <td>Baker</td> <td> [email protected] </td> <td>30</td> </tr> </table> <p>The rows <strong>id&apos;s 9, 12, and 14</strong> have been deleted. We use the below statement to delete the duplicate rows:</p> <p>Execute the script for <strong>creating</strong> the contact.</p> <pre> DELETE c1 FROM contacts c1 INNERJ OIN contacts c2 WHERE c1.id &gt; c2.id AND c1.email = c2.email; </pre> <br> <table class="table"> <tr> <td>id</td> <td>first_name</td> <td>last_name</td> <td>email</td> <td>age</td> </tr> <tr> <td>1</td> <td>Ben</td> <td>Barnes</td> <td> [email protected] </td> <td>21</td> </tr> <tr> <td>2</td> <td> <strong>Kavin</strong> </td> <td> <strong>Peterson</strong></td> <td> <strong> [email protected] </strong> </td> <td> <strong>22</strong> </td> </tr> <tr> <td>3</td> <td>Brian</td> <td>Blessed</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>4</td> <td>Nick</td> <td>Jonas</td> <td> [email protected] </td> <td>16</td> </tr> <tr> <td>5</td> <td>Michal</td> <td>Krane</td> <td> [email protected] </td> <td>17</td> </tr> <tr> <td>6</td> <td>Eliza</td> <td>Bennett</td> <td> [email protected] </td> <td>23</td> </tr> <tr> <td>7</td> <td>Michal</td> <td>Jackson</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>8</td> <td> <strong>Sean</strong> </td> <td> <strong>Bean</strong> </td> <td> <strong> [email protected] </strong> </td> <td> <strong>20</strong> </td> </tr> <tr> <td>9</td> <td>Mischa</td> <td>Barton</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>10</td> <td> <strong>Peter</strong> </td> <td> <strong>Heaven</strong> </td> <td> <strong> [email protected] </strong> </td> <td> <strong>25</strong> </td> </tr> <tr> <td>11</td> <td>Tom</td> <td>Baker</td> <td> [email protected] </td> <td>30</td> </tr> </table> <h2>(B) Delete duplicate rows using an intermediate table</h2> <p>To delete a duplicate row by using the intermediate table, follow the steps given below:</p> <p> <strong>Step 1</strong> . Create a new table <strong>structure</strong> , same as the real table:</p> <pre> CREATE TABLE source_copy LIKE source; </pre> <p> <strong>Step 2</strong> . Insert the distinct rows from the original schedule of the database:</p> <pre> INSERT INTO source_copy SELECT * FROM source GROUP BY col; </pre> <p> <strong>Step 3</strong> . Drop the original table and rename the immediate table to the original one.</p> <pre> DROP TABLE source; ALTER TABLE source_copy RENAME TO source; </pre> <p>For example, the following statements delete the <strong>rows</strong> with <strong>duplicate</strong> emails from the contacts table:</p> <pre> -- step 1 CREATE TABLE contacts_temp LIKE contacts; -- step 2 INSERT INTO contacts_temp SELECT * FROM contacts GROUP BY email; -- step 3 DROP TABLE contacts; ALTER TABLE contacts_temp RENAME TO contacts; </pre> <h2>(C) Delete duplicate rows using the ROW_NUMBER() Function</h2> <h4>Note: The ROW_NUMBER() function has been supported since MySQL version 8.02, so we should check our MySQL version before using the function.</h4> <p>The following statement uses the <strong>ROW_NUMBER ()</strong> to assign a sequential integer to every row. If the email is duplicate, the row will higher than one.</p> <pre> SELECT id, email, ROW_NUMBER() OVER (PARTITION BY email ORDER BY email ) AS row_num FROM contacts; </pre> <p>The following SQL query returns <strong>id list</strong> of the duplicate rows:</p> <pre> SELECT id FROM (SELECT id, ROW_NUMBER() OVER ( PARTITION BY email ORDER BY email) AS row_num FROM contacts ) t WHERE row_num&gt; 1; </pre> <p> <strong>Output:</strong> </p> <table class="table"> <tr> <td>id</td> </tr> <tr> <td>9</td> </tr> <tr> <td>12</td> </tr> <tr> <td>14</td> </tr> </table> <h2>Delete Duplicate Records in Oracle</h2> <p>When we found the duplicate records in the table, we had to delete the unwanted copies to keep our data clean and unique. If a table has duplicate rows, we can delete it by using the <strong>DELETE</strong> statement.</p> <p>In the case, we have a column, which is not the part of <strong>group</strong> used to <strong>evaluate</strong> the <strong>duplicate</strong> records in the table.</p> <p>Consider the table given below:</p> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>03</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>04</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>05</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>06</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>07</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <br> <pre> -- create the vegetable table CREATE TABLE vegetables ( VEGETABLE_ID NUMBER generated BY DEFAULT AS ID ENTITY, VEGETABLE_NAME VARCHAR2(100), color VARCHAR2(20), PRIMARY KEY (VEGETABLE_ID) ); </pre> <br> <pre> -- insert sample rows INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Yellow&apos;); </pre> <br> <pre> -- query data from the vegetable table SELECT * FROM vegetables; </pre> <p>Suppose, we want to keep the row with the highest <strong>VEGETABLE_ID</strong> and delete all other copies.</p> <pre> SELECT MAX (VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ORDER BY MAX(VEGETABLE_ID); </pre> <br> <table class="table"> <tr> <td>MAX(VEGETABLE_ID)</td> </tr> <tr> <td>2</td> </tr> <tr> <td>5</td> </tr> <tr> <td>6</td> </tr> <tr> <td>7</td> </tr> </table> <p>We use the <strong>DELETE</strong> statement to delete the rows whose values in the <strong>VEGETABLE_ID COLUMN</strong> are not the <strong>highest</strong> .</p> <pre> DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MAX(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ); </pre> <p>Three rows have been deleted.</p> <pre> SELECT *FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td> <strong>02</strong> </td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td> <strong>05</strong> </td> <td>Onion</td> <td>Red</td> </tr> <tr> <td> <strong>06</strong> </td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td> <strong>07</strong> </td> <td><pumpkin td> <td>Yellow</td> </pumpkin></td></tr> </table> <p>If we want to keep the row with the lowest id, use the <strong>MIN()</strong> function instead of the <strong>MAX()</strong> function.</p> <pre> DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MIN(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ); </pre> <p>The above method works if we have a column that is not part of the group for evaluating duplicate. If all values in the columns have copies, then we cannot use the <strong>VEGETABLE_ID</strong> column.</p> <p>Let&apos;s drop and create the <strong>vegetable</strong> table with a new structure.</p> <pre> DROP TABLE vegetables; CREATE TABLE vegetables ( VEGETABLE_ID NUMBER, VEGETABLE_NAME VARCHAR2(100), Color VARCHAR2(20) ); </pre> <br> <pre> INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1,&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1, &apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(3,&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(&apos;4,Pumpkin&apos;,&apos;Yellow&apos;); SELECT * FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>03</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>04</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <p>In the vegetable table, the values in all columns <strong>VEGETABLE_ID, VEGETABLE_NAME</strong> , and color have been copied.</p> <p>We can use the <strong>rowid</strong> , a locator that specifies where Oracle stores the row. Because the <strong>rowid</strong> is unique so that we can use it to remove the duplicates rows.</p> <pre> DELETE FROM Vegetables WHERE rowed NOT IN ( SELECT MIN(rowid) FROM vegetables GROUP BY VEGETABLE_ID, VEGETABLE_NAME, color ); </pre> <p>The query verifies the deletion operation:</p> <pre> SELECT * FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>03</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>04</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <hr></t2.id>

Drei Zeilen wurden gelöscht. Wir führen die unten angegebene Abfrage aus, um die zu finden doppelte E-Mails vom Tisch.

 SELECT email, COUNT (email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1;

Die Abfrage gibt die leere Menge zurück. Um die Daten aus der Kontakttabelle zu überprüfen, führen Sie die folgende SQL-Abfrage aus:

 SELECT * FROM contacts;

Ausweis	Vorname	Familienname, Nachname	Email	Alter
7	Ben	Barnes	[email protected]	einundzwanzig
13	Brian	Gesegnet	[email protected]	18
10	Eliza	Bennett	[email protected]	23
1	Kavin	Peterson	[email protected]	22
8	Mischa	Barton	[email protected]	zwanzig
elf	Michael	Wasserhähne	[email protected]	17
4	Michael	Jackson	[email protected]	18
2	Nick	Jonas	[email protected]	16
3	Peter	Himmel	[email protected]	25
5	Sean	Bohne	[email protected]	zwanzig
6	Tom	Bäcker	[email protected]	30

Die Reihen IDs 9, 12 und 14 wurden gelöscht. Wir verwenden die folgende Anweisung, um die doppelten Zeilen zu löschen:

Führen Sie das Skript aus für Erstellen der Kontakt.

Was ist Clustering?

 DELETE c1 FROM contacts c1 INNERJ OIN contacts c2 WHERE c1.id &gt; c2.id AND c1.email = c2.email;

Ausweis	Vorname	Familienname, Nachname	Email	Alter
1	Ben	Barnes	[email protected]	einundzwanzig
2	Kavin	Peterson	[email protected]	22
3	Brian	Gesegnet	[email protected]	18
4	Nick	Jonas	[email protected]	16
5	Michael	Wasserhähne	[email protected]	17
6	Eliza	Bennett	[email protected]	23
7	Michael	Jackson	[email protected]	18
8	Sean	Bohne	[email protected]	zwanzig
9	Mischa	Barton	[email protected]	zwanzig
10	Peter	Himmel	[email protected]	25
elf	Tom	Bäcker	[email protected]	30

(B) Löschen Sie doppelte Zeilen mithilfe einer Zwischentabelle

Um eine doppelte Zeile mithilfe der Zwischentabelle zu löschen, führen Sie die folgenden Schritte aus:

Schritt 1 . Erstellen Sie eine neue Tabelle Struktur , genau wie die echte Tabelle:

 CREATE TABLE source_copy LIKE source;

Schritt 2 . Fügen Sie die unterschiedlichen Zeilen aus dem ursprünglichen Zeitplan der Datenbank ein:

 INSERT INTO source_copy SELECT * FROM source GROUP BY col;

Schritt 3 . Löschen Sie die ursprüngliche Tabelle und benennen Sie die unmittelbare Tabelle in die ursprüngliche um.

 DROP TABLE source; ALTER TABLE source_copy RENAME TO source;

Die folgenden Anweisungen löschen beispielsweise die Reihen mit Duplikat E-Mails aus der Kontakttabelle:

 -- step 1 CREATE TABLE contacts_temp LIKE contacts; -- step 2 INSERT INTO contacts_temp SELECT * FROM contacts GROUP BY email; -- step 3 DROP TABLE contacts; ALTER TABLE contacts_temp RENAME TO contacts;

(C) Löschen Sie doppelte Zeilen mit der Funktion ROW_NUMBER()

Hinweis: Die Funktion ROW_NUMBER() wird seit MySQL-Version 8.02 unterstützt, daher sollten wir unsere MySQL-Version überprüfen, bevor wir die Funktion verwenden.

Die folgende Anweisung verwendet die ZEILENNUMMER () um jeder Zeile eine sequentielle Ganzzahl zuzuweisen. Wenn die E-Mail doppelt vorhanden ist, ist die Zeile höher als eins.

 SELECT id, email, ROW_NUMBER() OVER (PARTITION BY email ORDER BY email ) AS row_num FROM contacts;

Die folgende SQL-Abfrage gibt zurück ID-Liste der doppelten Zeilen:

 SELECT id FROM (SELECT id, ROW_NUMBER() OVER ( PARTITION BY email ORDER BY email) AS row_num FROM contacts ) t WHERE row_num&gt; 1;

Ausgabe:

Ausweis

Löschen Sie doppelte Datensätze in Oracle

Als wir die doppelten Datensätze in der Tabelle fanden, mussten wir die unerwünschten Kopien löschen, um unsere Daten sauber und eindeutig zu halten. Wenn eine Tabelle doppelte Zeilen enthält, können wir sie mithilfe von löschen LÖSCHEN Stellungnahme.

In dem Fall haben wir eine Spalte, die nicht Teil davon ist Gruppe gewöhnt an auswerten Die Duplikat Datensätze in der Tabelle.

Betrachten Sie die folgende Tabelle:

VEGETABLE_ID	VEGETABLE_NAME	FARBE
01	Kartoffel	Braun
02	Kartoffel	Braun
03	Zwiebel	Rot
04	Zwiebel	Rot
05	Zwiebel	Rot
06	Kürbis	Grün
07	Kürbis	Gelb

 -- create the vegetable table CREATE TABLE vegetables ( VEGETABLE_ID NUMBER generated BY DEFAULT AS ID ENTITY, VEGETABLE_NAME VARCHAR2(100), color VARCHAR2(20), PRIMARY KEY (VEGETABLE_ID) );

 -- insert sample rows INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Yellow&apos;);

 -- query data from the vegetable table SELECT * FROM vegetables;

Angenommen, wir möchten die Zeile mit der höchsten Position behalten VEGETABLE_ID und löschen Sie alle anderen Kopien.

 SELECT MAX (VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ORDER BY MAX(VEGETABLE_ID);

MAX(VEGETABLE_ID)

Wir benutzen das LÖSCHEN Anweisung zum Löschen der Zeilen, deren Werte in der VEGETABLE_ID-Spalte sind nicht die höchste .

 DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MAX(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color );

Drei Zeilen wurden gelöscht.

 SELECT *FROM vegetables;

VEGETABLE_ID	VEGETABLE_NAME	FARBE
02	Kartoffel	Braun
05	Zwiebel	Rot
06	Kürbis	Grün
07		Gelb

Wenn wir die Zeile mit der niedrigsten ID behalten möchten, verwenden Sie die MINDEST() Funktion anstelle der MAX() Funktion.

 DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MIN(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color );

Die obige Methode funktioniert, wenn wir eine Spalte haben, die nicht Teil der Gruppe zur Auswertung von Duplikaten ist. Wenn alle Werte in den Spalten Kopien haben, können wir diese nicht verwenden VEGETABLE_ID Spalte.

Lassen Sie uns fallen und erstellen Gemüse Tisch mit neuer Struktur.

 DROP TABLE vegetables; CREATE TABLE vegetables ( VEGETABLE_ID NUMBER, VEGETABLE_NAME VARCHAR2(100), Color VARCHAR2(20) );

 INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1,&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1, &apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(3,&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(&apos;4,Pumpkin&apos;,&apos;Yellow&apos;); SELECT * FROM vegetables;

VEGETABLE_ID	VEGETABLE_NAME	FARBE
01	Kartoffel	Braun
01	Kartoffel	Braun
02	Zwiebel	Rot
02	Zwiebel	Rot
02	Zwiebel	Rot
03	Kürbis	Grün
04	Kürbis	Gelb

In der Gemüsetabelle die Werte in allen Spalten VEGETABLE_ID, VEGETABLE_NAME , und Farbe wurden kopiert.

Wir können das nutzen rowid , ein Locator, der angibt, wo Oracle die Zeile speichert. Weil das rowid ist eindeutig, sodass wir damit die doppelten Zeilen entfernen können.

 DELETE FROM Vegetables WHERE rowed NOT IN ( SELECT MIN(rowid) FROM vegetables GROUP BY VEGETABLE_ID, VEGETABLE_NAME, color );

Die Abfrage überprüft den Löschvorgang:

 SELECT * FROM vegetables;

VEGETABLE_ID	VEGETABLE_NAME	FARBE
01	Kartoffel	Braun
02	Zwiebel	Rot
03	Kürbis	Grün
04	Kürbis	Gelb

TechCodeview

Wie lösche ich doppelte Zeilen in SQL?