随笔 - 120  文章 - 0  评论 - 902  阅读 - 51万

clustered index disadvantages in mysql

If you don’t define a primary key, InnoDB will try to use a unique nonnullable index
instead. If there’s no such index, InnoDB will define a hidden primary key for you
and then cluster on that.* InnoDB clusters records together only within a page. Pages

with adjacent key values may be distant from each other.

These benefits can boost performance tremendously if you design your tables and queries
to take advantage of them. However, clustered indexes also have disadvantages:
• Clustering gives the largest improvement for I/O-bound workloads. If the data
fits in memory the order in which it’s accessed doesn’t really matter, so clustering
doesn’t give much benefit.
Insert speeds depend heavily on insertion order. Inserting rows in primary key
order is the fastest way to load data into an InnoDB table. It may be a good idea
to reorganize the table with OPTIMIZE TABLE after loading a lot of data if you
didn’t load the rows in primary key order.
• Updating the clustered index columns is expensive, because it forces InnoDB to
move each updated row to a new location.
• Tables built upon clustered indexes are subject to page splits when new rows are
inserted, or when a row’s primary key is updated such that the row must be
moved. A page split happens when a row’s key value dictates that the row must
be placed into a page that is full of data. The storage engine must split the page
into two to accommodate the row. Page splits can cause a table to use more
space on disk.
• Clustered tables can be slower for full table scans, especially if rows are less
densely packed or stored nonsequentially because of page splits.
• Secondary (nonclustered) indexes can be larger than you might expect, because
their leaf nodes contain the primary key columns of the referenced rows.
• Secondary index accesses require two index lookups instead of one.
The last point can be a bit confusing. Why would a secondary index require two
index lookups? The answer lies in the nature of the “row pointers” the secondary
index stores. Remember, a leaf node doesn’t store a pointer to the referenced row’s
physical location; rather, it stores the row’s primary key values. 

posted on   Keep Walking  阅读(2126)  评论(0编辑  收藏  举报
编辑推荐:
· 如何编写易于单元测试的代码
· 10年+ .NET Coder 心语,封装的思维:从隐藏、稳定开始理解其本质意义
· .NET Core 中如何实现缓存的预热?
· 从 HTTP 原因短语缺失研究 HTTP/2 和 HTTP/3 的设计差异
· AI与.NET技术实操系列:向量存储与相似性搜索在 .NET 中的实现
阅读排行:
· 周边上新:园子的第一款马克杯温暖上架
· Open-Sora 2.0 重磅开源!
· 分享 3 个 .NET 开源的文件压缩处理库,助力快速实现文件压缩解压功能!
· Ollama——大语言模型本地部署的极速利器
· [AI/GPT/综述] AI Agent的设计模式综述
< 2009年12月 >
29 30 1 2 3 4 5
6 7 8 9 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31 1 2
3 4 5 6 7 8 9

点击右上角即可分享
微信分享提示