MySQL 数据库中删除重复数据的方法

 演示数据,仅供参考

查询表结构:

mysql> desc test;
+-------+------------------+------+-----+---------+----------------+
| Field | Type             | Null | Key | Default | Extra          |
+-------+------------------+------+-----+---------+----------------+
| id    | int(11) unsigned | NO   | PRI | NULL    | auto_increment |
| site  | varchar(100)     | NO   | MUL |         |                |
+-------+------------------+------+-----+---------+----------------+
2 rows in set (0.00 sec)

查询数据:

复制代码
mysql> select * from test order by id;
+----+------------------------+
| id | site                   |
+----+------------------------+|  1 | http://www.baidu.com  |
|  2 | http://www.hao123.com        |
|  3 | http://www.huwei.com |
|  4 | http://www.baidu.com  |
|  5 | http://www.huwei.com |
+----+------------------------+
5 rows in set (0.00 sec)
复制代码

当没有创建表或创建索引权限的时候,如果你要删除较旧的重复记录,可以使用下面的语句:

复制代码
mysql> delete from a
    -> using test as a, test as b
    -> where (a.id > b.id)
    -> and (a.site = b.site);
Query OK, 2 rows affected (0.12 sec)

mysql> select * from test order by id;
+----+------------------------+
| id | site                   |
+----+------------------------+
|  1 | http://www.baidu.com  |
|  2 | http://www.hao123.com        |
|  3 | http://www.huwei.com |
+----+------------------------+
3 rows in set (0.00 sec)
复制代码

 

如果你要删除较新的重复记录,可以使用下面的语句:

复制代码
mysql> delete from a
    -> using test as a, test as b
    -> where (a.id < b.id)
    -> and (a.site = b.site);
Query OK, 2 rows affected (0.12 sec)

mysql> select * from test order by id;
+----+------------------------+
| id | site                   |
+----+------------------------+
|  2 | http://www.hao123.com        |
|  4 | http://www.baidu.com  |
|  5 | http://www.huwei.com |
+----+------------------------+
3 rows in set (0.00 sec)
复制代码

你可以用下面的语句先确认将被删除的重复记录:

复制代码
mysql> SELECT a.*
    -> FROM test a, test b
    -> WHERE a.id > b.id
    -> AND (a.site = b.site);
+----+------------------------+
| id | site                   |
+----+------------------------+
|  1 | http://www.baidu.com  |
|  3 | http://www.huwei.com |
+----+------------------------+
2 rows in set (0.00 sec)
复制代码

如果有创建索引的权限,在表上创建唯一键索引,可以用下面的方法:

复制代码
mysql> alter ignore table test add unique index ukey (site);
Query OK, 5 rows affected (0.46 sec)
Records: 5  Duplicates: 2  Warnings: 0

mysql> select * from test order by id;
+----+------------------------+
| id | site                   |
+----+------------------------+
|  1 | http://www.baidu.com  |
|  2 | http://www.hao123.com        |
|  3 | http://www.huwei.com |
+----+------------------------+
3 rows in set (0.00 sec)
复制代码

重复记录被删除后,如果需要,可以删除索引:

mysql> alter table test drop index ukey;
Query OK, 3 rows affected (0.37 sec)
Records: 3  Duplicates: 0  Warnings: 0

如果有创建表的权限,创建一个新表,然后将原表中不重复的数据插入新表:

复制代码
mysql> create table test_new as select * from test group by site;
Query OK, 3 rows affected (0.19 sec)
Records: 3  Duplicates: 0  Warnings: 0

mysql> show tables;
+----------------+
| Tables_in_test |
+----------------+
| test           |
| test_new       |
+----------------+
2 rows in set (0.00 sec)

mysql> select * from test order by id;
+----+------------------------+
| id | site                   |
+----+------------------------+
|  1 | http://www.baidu.com  |
|  2 | http://www.hao123.com        |
|  3 | http://www.huwei.com |
|  4 | http://www.baidu.com  |
|  5 | http://www.huwei.com |
+----+------------------------+
5 rows in set (0.00 sec)

mysql> select * from test_new order by id;
+----+------------------------+
| id | site                   |
+----+------------------------+
|  1 | http://www.baidu.com  |
|  2 | http://www.hao123.com        |
|  3 | http://www.huwei.com |
+----+------------------------+
3 rows in set (0.00 sec)
复制代码

然后将原表备份,将新表重命名为当前表:

复制代码
mysql> rename table test to test_old, test_new to test;
Query OK, 0 rows affected (0.04 sec)

mysql> show tables;
+----------------+
| Tables_in_test |
+----------------+
| test           |
| test_old       |
+----------------+
2 rows in set (0.00 sec)

mysql> select * from test order by id;
+----+------------------------+
| id | site                   |
+----+------------------------+
|  1 | http://www.baidu.com  |
|  2 | http://www.hao123.com        |
|  3 | http://www.huwei.com |
+----+------------------------+
3 rows in set (0.00 sec)
复制代码

注意:使用这种方式创建的表会丢失原表的索引信息!

mysql> desc test;
+-------+------------------+------+-----+---------+-------+
| Field | Type             | Null | Key | Default | Extra |
+-------+------------------+------+-----+---------+-------+
| id    | int(11) unsigned | NO   |     | 0       |       |
| site  | varchar(100)     | NO   |     |         |       |
+-------+------------------+------+-----+---------+-------+
2 rows in set (0.00 sec)

 

posted @   牵只蜗牛去散步  阅读(3895)  评论(0编辑  收藏  举报
编辑推荐:
· 10年+ .NET Coder 心语,封装的思维:从隐藏、稳定开始理解其本质意义
· .NET Core 中如何实现缓存的预热?
· 从 HTTP 原因短语缺失研究 HTTP/2 和 HTTP/3 的设计差异
· AI与.NET技术实操系列:向量存储与相似性搜索在 .NET 中的实现
· 基于Microsoft.Extensions.AI核心库实现RAG应用
阅读排行:
· TypeScript + Deepseek 打造卜卦网站:技术与玄学的结合
· 阿里巴巴 QwQ-32B真的超越了 DeepSeek R-1吗?
· 【译】Visual Studio 中新的强大生产力特性
· 10年+ .NET Coder 心语 ── 封装的思维:从隐藏、稳定开始理解其本质意义
· 【设计模式】告别冗长if-else语句:使用策略模式优化代码结构
历史上的今天:
2018-08-06 银行卡信用卡的支付测试
点击右上角即可分享
微信分享提示