AKO ODSTRÁNIŤ DUPLICITNÉ RIADKY V SQL

V tejto časti sa naučíme rôzne spôsoby, ako odstrániť duplicitné riadky MySQL a Oracle . Ak SQL tabuľka obsahuje duplicitné riadky, potom musíme duplicitné riadky odstrániť.

java architektúra

Príprava vzorových údajov

Skript vytvorí tabuľku s názvom kontakty .

 DROP TABLE IF EXISTS contacts; CREATE TABLE contacts ( id INT PRIMARY KEY AUTO_INCREMENT, first_name VARCHAR(30) NOT NULL, last_name VARCHAR(25) NOT NULL, email VARCHAR(210) NOT NULL, age VARCHAR(22) NOT NULL );

Do vyššie uvedenej tabuľky sme vložili nasledujúce údaje.

 INSERT INTO contacts (first_name,last_name,email,age) VALUES (&apos;Kavin&apos;,&apos;Peterson&apos;,&apos;[email protected]&apos;,&apos;21&apos;), (&apos;Nick&apos;,&apos;Jonas&apos;,&apos;[email protected]&apos;,&apos;18&apos;), (&apos;Peter&apos;,&apos;Heaven&apos;,&apos;[email protected]&apos;,&apos;23&apos;), (&apos;Michal&apos;,&apos;Jackson&apos;,&apos;[email protected]&apos;,&apos;22&apos;), (&apos;Sean&apos;,&apos;Bean&apos;,&apos;[email protected]&apos;,&apos;23&apos;), (&apos;Tom &apos;,&apos;Baker&apos;,&apos;[email protected]&apos;,&apos;20&apos;), (&apos;Ben&apos;,&apos;Barnes&apos;,&apos;[email protected]&apos;,&apos;17&apos;), (&apos;Mischa &apos;,&apos;Barton&apos;,&apos;[email protected]&apos;,&apos;18&apos;), (&apos;Sean&apos;,&apos;Bean&apos;,&apos;[email protected]&apos;,&apos;16&apos;), (&apos;Eliza&apos;,&apos;Bennett&apos;,&apos;[email protected]&apos;,&apos;25&apos;), (&apos;Michal&apos;,&apos;Krane&apos;,&apos;[email protected]&apos;,&apos;25&apos;), (&apos;Peter&apos;,&apos;Heaven&apos;,&apos;[email protected]&apos;,&apos;20&apos;), (&apos;Brian&apos;,&apos;Blessed&apos;,&apos;[email protected]&apos;,&apos;20&apos;); (&apos;Kavin&apos;,&apos;Peterson&apos;,&apos;[email protected]&apos;,&apos;30&apos;),

Skript spustíme na opätovné vytvorenie testovacích údajov po vykonaní a VYMAZAŤ vyhlásenie .

Dotaz vráti údaje z tabuľky kontaktov:

 SELECT * FROM contacts ORDER BY email;

id	krstné meno	priezvisko	Email	Vek
7	Ben	Barnes	[chránený e-mailom]	dvadsaťjeden
13	Brian	Blahoslavený	[chránený e-mailom]	18
10	Eliza	Bennett	[chránený e-mailom]	23
1	Kavin	Peterson	[chránený e-mailom]	22
14	Kavin	Peterson	[chránený e-mailom]	23
8	Misha	Barton	[chránený e-mailom]	dvadsať
jedenásť	Michal	Kohútiky	[chránený e-mailom]	17
4	Michal	Jackson	[chránený e-mailom]	18
2	Nick	Jonáš	[chránený e-mailom]	16
3	Peter	Nebo	[chránený e-mailom]	25
12	Peter	Nebo	[chránený e-mailom]	25
5	Sean	Bean	[chránený e-mailom]	dvadsať
9	Sean	Bean	[chránený e-mailom]	dvadsať
6	Tom	Pekár	[chránený e-mailom]	30

Nasledujúci SQL dotaz vráti duplicitné e-maily z tabuľky kontaktov:

 SELECT email, COUNT(email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1;

e-mailom	COUNT(e-mail)
[chránený e-mailom]	2
[chránený e-mailom]	2
[chránený e-mailom]	2

Máme tri riadky s duplikát e-maily.

konštruktory v jave

(A) Vymažte duplicitné riadky pomocou príkazu DELETE JOIN

 DELETE t1 FROM contacts t1 INNERJOIN contacts t2 WHERE t1.id <t2.id and t1.email="t2.email;" < pre> <p> <strong>Output:</strong> </p> <pre> Query OK, three rows affected (0.10 sec) </pre> <p>Three rows had been deleted. We execute the query, given below to finds the <strong>duplicate emails</strong> from the table.</p> <pre> SELECT email, COUNT (email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1; </pre> <p>The query returns the empty set. To verify the data from the contacts table, execute the following SQL query:</p> <pre> SELECT * FROM contacts; </pre> <br> <table class="table"> <tr> <td>id</td> <td>first_name</td> <td>last_name</td> <td>Email</td> <td>age</td> </tr> <tr> <td>7</td> <td>Ben</td> <td>Barnes</td> <td> [email protected] </td> <td>21</td> </tr> <tr> <td>13</td> <td>Brian</td> <td>Blessed</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>10</td> <td>Eliza</td> <td>Bennett</td> <td> [email protected] </td> <td>23</td> </tr> <tr> <td>1</td> <td>Kavin</td> <td>Peterson</td> <td> [email protected] </td> <td>22</td> </tr> <tr> <td>8</td> <td>Mischa</td> <td>Barton</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>11</td> <td>Micha</td> <td>Krane</td> <td> [email protected] </td> <td>17</td> </tr> <tr> <td>4</td> <td>Michal</td> <td>Jackson</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>2</td> <td>Nick</td> <td>Jonas</td> <td> [email protected] </td> <td>16</td> </tr> <tr> <td>3</td> <td>Peter</td> <td>Heaven</td> <td> [email protected] </td> <td>25</td> </tr> <tr> <td>5</td> <td>Sean</td> <td>Bean</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>6</td> <td>Tom</td> <td>Baker</td> <td> [email protected] </td> <td>30</td> </tr> </table> <p>The rows <strong>id&apos;s 9, 12, and 14</strong> have been deleted. We use the below statement to delete the duplicate rows:</p> <p>Execute the script for <strong>creating</strong> the contact.</p> <pre> DELETE c1 FROM contacts c1 INNERJ OIN contacts c2 WHERE c1.id &gt; c2.id AND c1.email = c2.email; </pre> <br> <table class="table"> <tr> <td>id</td> <td>first_name</td> <td>last_name</td> <td>email</td> <td>age</td> </tr> <tr> <td>1</td> <td>Ben</td> <td>Barnes</td> <td> [email protected] </td> <td>21</td> </tr> <tr> <td>2</td> <td> <strong>Kavin</strong> </td> <td> <strong>Peterson</strong></td> <td> <strong> [email protected] </strong> </td> <td> <strong>22</strong> </td> </tr> <tr> <td>3</td> <td>Brian</td> <td>Blessed</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>4</td> <td>Nick</td> <td>Jonas</td> <td> [email protected] </td> <td>16</td> </tr> <tr> <td>5</td> <td>Michal</td> <td>Krane</td> <td> [email protected] </td> <td>17</td> </tr> <tr> <td>6</td> <td>Eliza</td> <td>Bennett</td> <td> [email protected] </td> <td>23</td> </tr> <tr> <td>7</td> <td>Michal</td> <td>Jackson</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>8</td> <td> <strong>Sean</strong> </td> <td> <strong>Bean</strong> </td> <td> <strong> [email protected] </strong> </td> <td> <strong>20</strong> </td> </tr> <tr> <td>9</td> <td>Mischa</td> <td>Barton</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>10</td> <td> <strong>Peter</strong> </td> <td> <strong>Heaven</strong> </td> <td> <strong> [email protected] </strong> </td> <td> <strong>25</strong> </td> </tr> <tr> <td>11</td> <td>Tom</td> <td>Baker</td> <td> [email protected] </td> <td>30</td> </tr> </table> <h2>(B) Delete duplicate rows using an intermediate table</h2> <p>To delete a duplicate row by using the intermediate table, follow the steps given below:</p> <p> <strong>Step 1</strong> . Create a new table <strong>structure</strong> , same as the real table:</p> <pre> CREATE TABLE source_copy LIKE source; </pre> <p> <strong>Step 2</strong> . Insert the distinct rows from the original schedule of the database:</p> <pre> INSERT INTO source_copy SELECT * FROM source GROUP BY col; </pre> <p> <strong>Step 3</strong> . Drop the original table and rename the immediate table to the original one.</p> <pre> DROP TABLE source; ALTER TABLE source_copy RENAME TO source; </pre> <p>For example, the following statements delete the <strong>rows</strong> with <strong>duplicate</strong> emails from the contacts table:</p> <pre> -- step 1 CREATE TABLE contacts_temp LIKE contacts; -- step 2 INSERT INTO contacts_temp SELECT * FROM contacts GROUP BY email; -- step 3 DROP TABLE contacts; ALTER TABLE contacts_temp RENAME TO contacts; </pre> <h2>(C) Delete duplicate rows using the ROW_NUMBER() Function</h2> <h4>Note: The ROW_NUMBER() function has been supported since MySQL version 8.02, so we should check our MySQL version before using the function.</h4> <p>The following statement uses the <strong>ROW_NUMBER ()</strong> to assign a sequential integer to every row. If the email is duplicate, the row will higher than one.</p> <pre> SELECT id, email, ROW_NUMBER() OVER (PARTITION BY email ORDER BY email ) AS row_num FROM contacts; </pre> <p>The following SQL query returns <strong>id list</strong> of the duplicate rows:</p> <pre> SELECT id FROM (SELECT id, ROW_NUMBER() OVER ( PARTITION BY email ORDER BY email) AS row_num FROM contacts ) t WHERE row_num&gt; 1; </pre> <p> <strong>Output:</strong> </p> <table class="table"> <tr> <td>id</td> </tr> <tr> <td>9</td> </tr> <tr> <td>12</td> </tr> <tr> <td>14</td> </tr> </table> <h2>Delete Duplicate Records in Oracle</h2> <p>When we found the duplicate records in the table, we had to delete the unwanted copies to keep our data clean and unique. If a table has duplicate rows, we can delete it by using the <strong>DELETE</strong> statement.</p> <p>In the case, we have a column, which is not the part of <strong>group</strong> used to <strong>evaluate</strong> the <strong>duplicate</strong> records in the table.</p> <p>Consider the table given below:</p> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>03</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>04</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>05</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>06</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>07</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <br> <pre> -- create the vegetable table CREATE TABLE vegetables ( VEGETABLE_ID NUMBER generated BY DEFAULT AS ID ENTITY, VEGETABLE_NAME VARCHAR2(100), color VARCHAR2(20), PRIMARY KEY (VEGETABLE_ID) ); </pre> <br> <pre> -- insert sample rows INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Yellow&apos;); </pre> <br> <pre> -- query data from the vegetable table SELECT * FROM vegetables; </pre> <p>Suppose, we want to keep the row with the highest <strong>VEGETABLE_ID</strong> and delete all other copies.</p> <pre> SELECT MAX (VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ORDER BY MAX(VEGETABLE_ID); </pre> <br> <table class="table"> <tr> <td>MAX(VEGETABLE_ID)</td> </tr> <tr> <td>2</td> </tr> <tr> <td>5</td> </tr> <tr> <td>6</td> </tr> <tr> <td>7</td> </tr> </table> <p>We use the <strong>DELETE</strong> statement to delete the rows whose values in the <strong>VEGETABLE_ID COLUMN</strong> are not the <strong>highest</strong> .</p> <pre> DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MAX(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ); </pre> <p>Three rows have been deleted.</p> <pre> SELECT *FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td> <strong>02</strong> </td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td> <strong>05</strong> </td> <td>Onion</td> <td>Red</td> </tr> <tr> <td> <strong>06</strong> </td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td> <strong>07</strong> </td> <td><pumpkin td> <td>Yellow</td> </pumpkin></td></tr> </table> <p>If we want to keep the row with the lowest id, use the <strong>MIN()</strong> function instead of the <strong>MAX()</strong> function.</p> <pre> DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MIN(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ); </pre> <p>The above method works if we have a column that is not part of the group for evaluating duplicate. If all values in the columns have copies, then we cannot use the <strong>VEGETABLE_ID</strong> column.</p> <p>Let&apos;s drop and create the <strong>vegetable</strong> table with a new structure.</p> <pre> DROP TABLE vegetables; CREATE TABLE vegetables ( VEGETABLE_ID NUMBER, VEGETABLE_NAME VARCHAR2(100), Color VARCHAR2(20) ); </pre> <br> <pre> INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1,&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1, &apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(3,&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(&apos;4,Pumpkin&apos;,&apos;Yellow&apos;); SELECT * FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>03</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>04</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <p>In the vegetable table, the values in all columns <strong>VEGETABLE_ID, VEGETABLE_NAME</strong> , and color have been copied.</p> <p>We can use the <strong>rowid</strong> , a locator that specifies where Oracle stores the row. Because the <strong>rowid</strong> is unique so that we can use it to remove the duplicates rows.</p> <pre> DELETE FROM Vegetables WHERE rowed NOT IN ( SELECT MIN(rowid) FROM vegetables GROUP BY VEGETABLE_ID, VEGETABLE_NAME, color ); </pre> <p>The query verifies the deletion operation:</p> <pre> SELECT * FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>03</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>04</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <hr></t2.id>

Boli vymazané tri riadky. Spustíme dotaz uvedený nižšie, aby sme našli duplicitné e-maily od stola.

 SELECT email, COUNT (email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1;

Dotaz vráti prázdnu množinu. Ak chcete overiť údaje z tabuľky kontaktov, vykonajte nasledujúci SQL dotaz:

 SELECT * FROM contacts;

id	krstné meno	priezvisko	Email	Vek
7	Ben	Barnes	[chránený e-mailom]	dvadsaťjeden
13	Brian	Blahoslavený	[chránený e-mailom]	18
10	Eliza	Bennett	[chránený e-mailom]	23
1	Kavin	Peterson	[chránený e-mailom]	22
8	Misha	Barton	[chránený e-mailom]	dvadsať
jedenásť	Michael	Kohútiky	[chránený e-mailom]	17
4	Michal	Jackson	[chránený e-mailom]	18
2	Nick	Jonáš	[chránený e-mailom]	16
3	Peter	Nebo	[chránený e-mailom]	25
5	Sean	Bean	[chránený e-mailom]	dvadsať
6	Tom	Pekár	[chránený e-mailom]	30

Riadky id je 9, 12 a 14 boli odstránené. Na odstránenie duplicitných riadkov používame nasledujúce vyhlásenie:

Spustite skript pre vytváranie kontakt.

 DELETE c1 FROM contacts c1 INNERJ OIN contacts c2 WHERE c1.id &gt; c2.id AND c1.email = c2.email;

id	krstné meno	priezvisko	e-mailom	Vek
1	Ben	Barnes	[chránený e-mailom]	dvadsaťjeden
2	Kavin	Peterson	[chránený e-mailom]	22
3	Brian	Blahoslavený	[chránený e-mailom]	18
4	Nick	Jonáš	[chránený e-mailom]	16
5	Michal	Kohútiky	[chránený e-mailom]	17
6	Eliza	Bennett	[chránený e-mailom]	23
7	Michal	Jackson	[chránený e-mailom]	18
8	Sean	Bean	[chránený e-mailom]	dvadsať
9	Misha	Barton	[chránený e-mailom]	dvadsať
10	Peter	Nebo	[chránený e-mailom]	25
jedenásť	Tom	Pekár	[chránený e-mailom]	30

(B) Odstráňte duplicitné riadky pomocou prechodnej tabuľky

Ak chcete odstrániť duplicitný riadok pomocou prechodnej tabuľky, postupujte podľa krokov uvedených nižšie:

Krok 1 . Vytvorte novú tabuľku štruktúru , rovnako ako skutočná tabuľka:

 CREATE TABLE source_copy LIKE source;

Krok 2 . Vložte odlišné riadky z pôvodného plánu databázy:

 INSERT INTO source_copy SELECT * FROM source GROUP BY col;

Krok 3 . Zrušte pôvodnú tabuľku a premenujte okamžitú tabuľku na pôvodnú.

java pole

 DROP TABLE source; ALTER TABLE source_copy RENAME TO source;

Napríklad nasledujúce príkazy vymazávajú riadkov s duplikát e-maily z tabuľky kontaktov:

 -- step 1 CREATE TABLE contacts_temp LIKE contacts; -- step 2 INSERT INTO contacts_temp SELECT * FROM contacts GROUP BY email; -- step 3 DROP TABLE contacts; ALTER TABLE contacts_temp RENAME TO contacts;

(C) Odstráňte duplicitné riadky pomocou funkcie ROW_NUMBER().

Poznámka: Funkcia ROW_NUMBER() je podporovaná od verzie MySQL 8.02, takže pred použitím funkcie by sme mali skontrolovať našu verziu MySQL.

Nasledujúce vyhlásenie používa ROW_NUMBER () priradiť sekvenčné celé číslo každému riadku. Ak je e-mail duplicitný, riadok bude vyšší ako jeden.

 SELECT id, email, ROW_NUMBER() OVER (PARTITION BY email ORDER BY email ) AS row_num FROM contacts;

Vráti sa nasledujúci SQL dotaz zoznam id z duplicitných riadkov:

 SELECT id FROM (SELECT id, ROW_NUMBER() OVER ( PARTITION BY email ORDER BY email) AS row_num FROM contacts ) t WHERE row_num&gt; 1;

Výkon:

zlúčiť triedenie v jave

Odstrániť duplicitné záznamy v Oracle

Keď sme v tabuľke našli duplicitné záznamy, museli sme vymazať nechcené kópie, aby boli naše údaje čisté a jedinečné. Ak má tabuľka duplicitné riadky, môžeme ju odstrániť pomocou VYMAZAŤ vyhlásenie.

V prípade máme stĺpec, ktorý nie je súčasťou skupina zvyknutý ohodnotiť na duplikát záznamy v tabuľke.

Zvážte tabuľku uvedenú nižšie:

VEGETABLE_ID	VEGETABLE_NAME	FARBA
01	Zemiak	Hnedá
02	Zemiak	Hnedá
03	Cibuľa	Červená
04	Cibuľa	Červená
05	Cibuľa	Červená
06	Tekvica	zelená
07	Tekvica	žltá

 -- create the vegetable table CREATE TABLE vegetables ( VEGETABLE_ID NUMBER generated BY DEFAULT AS ID ENTITY, VEGETABLE_NAME VARCHAR2(100), color VARCHAR2(20), PRIMARY KEY (VEGETABLE_ID) );

 -- insert sample rows INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Yellow&apos;);

 -- query data from the vegetable table SELECT * FROM vegetables;

Predpokladajme, že chceme ponechať riadok s najvyšším VEGETABLE_ID a vymažte všetky ostatné kópie.

 SELECT MAX (VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ORDER BY MAX(VEGETABLE_ID);

MAX (VEGETABLE_ID)

Používame VYMAZAŤ príkaz na vymazanie riadkov, ktorých hodnoty v STĹPEC VEGETABLE_ID nie sú najvyššie .

 DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MAX(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color );

Boli odstránené tri riadky.

 SELECT *FROM vegetables;

VEGETABLE_ID	VEGETABLE_NAME	FARBA
02	Zemiak	Hnedá
05	Cibuľa	Červená
06	Tekvica	zelená
07		žltá

Ak chceme ponechať riadok s najnižším id, použite MIN() namiesto funkcie MAX() funkciu.

 DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MIN(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color );

Vyššie uvedená metóda funguje, ak máme stĺpec, ktorý nie je súčasťou skupiny na vyhodnotenie duplikátu. Ak všetky hodnoty v stĺpcoch majú kópie, potom nemôžeme použiť VEGETABLE_ID stĺpec.

Poďme klesnúť a vytvoriť zeleninové stôl s novou štruktúrou.

 DROP TABLE vegetables; CREATE TABLE vegetables ( VEGETABLE_ID NUMBER, VEGETABLE_NAME VARCHAR2(100), Color VARCHAR2(20) );

 INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1,&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1, &apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(3,&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(&apos;4,Pumpkin&apos;,&apos;Yellow&apos;); SELECT * FROM vegetables;

VEGETABLE_ID	VEGETABLE_NAME	FARBA
01	Zemiak	Hnedá
01	Zemiak	Hnedá
02	Cibuľa	Červená
02	Cibuľa	Červená
02	Cibuľa	Červená
03	Tekvica	zelená
04	Tekvica	žltá

V tabuľke zeleniny hodnoty vo všetkých stĺpcoch VEGETABLE_ID, VEGETABLE_NAME , a farba boli skopírované.

neriadený prechod binárneho stromu

Môžeme použiť rowid , lokátor, ktorý určuje, kde Oracle ukladá riadok. Pretože rowid je jedinečný, takže ho môžeme použiť na odstránenie duplicitných riadkov.

 DELETE FROM Vegetables WHERE rowed NOT IN ( SELECT MIN(rowid) FROM vegetables GROUP BY VEGETABLE_ID, VEGETABLE_NAME, color );

Dotaz overí operáciu odstránenia:

 SELECT * FROM vegetables;

VEGETABLE_ID	VEGETABLE_NAME	FARBA
01	Zemiak	Hnedá
02	Cibuľa	Červená
03	Tekvica	zelená
04	Tekvica	žltá

TechCodeview

Ako odstrániť duplicitné riadky v SQL?