Are mixed pages removed by an index rebuild?重建索引会回收混合区的页面吗
Are mixed pages removed by an index rebuild?重建索引会回收混合区的页面吗
http://www.sqlskills.com/blogs/paul/mixed-pages-removed-index-rebuild/
This is a question that came up this morning during our IE1 class that I thought would make an interesting blog post as there are some twists to the answer.
The first 8 pages that are allocated to an allocation unit are mixed pages from mixed extents, unless trace flag 1118 is enabled.
See the following blog posts for more info:
- Inside the Storage Engine: IAM pages, IAM chains, and allocation units
- Inside the Storage Engine: Anatomy of an extent
- Misconceptions around TF 1118
Assuming that mixed pages are not disabled with trace flag 1118, does an index rebuild remove all mixed pages or not?
Let’s investigate. First I’ll create a clustered index with 1,000 data pages:
1
2
3
4
5
6
|
CREATE TABLE [MixedTest] ([c1] BIGINT IDENTITY, [c2] CHAR (8000) DEFAULT 'a' ); CREATE CLUSTERED INDEX [MixedTest_CL] ON [MixedTest] ([c1]); SET NOCOUNT ON ; GO INSERT INTO [MixedTest] DEFAULT VALUES ; GO 1000 |
And then make sure that we have mixed pages be examining the first IAM page in the clustered index’s IAM chain. You can get the sp_AllocationMetadata prochere.
1
2
|
EXEC [sp_AllocationMetadata] N 'MixedTest' ; GO |
Object Name Index ID Alloc Unit ID Alloc Unit Type First Page Root Page First IAM Page ------------ --------- ------------------ ---------------- ----------- ---------- --------------- MixedTest 1 72057594046185472 IN_ROW_DATA (1:987) (1:1732) (1:988) |
1
2
3
|
DBCC TRACEON (3604); DBCC PAGE (N 'master' , 1, 988, 3); GO |
(I’m just including the relevant portion of the DBCC PAGE output here…)
<snip> IAM: Single Page Allocations @0x00000000227EA08E Slot 0 = (1:987) Slot 1 = (1:989) Slot 2 = (1:990) Slot 3 = (1:991) Slot 4 = (1:1816) Slot 5 = (1:1817) Slot 6 = (1:1818) Slot 7 = (1:1819) <snip> |
Now I’ll do an offline index rebuild of the clustered index, and look again at the IAM page contents (assume I’m running the proc and DBCC PAGE after the rebuild):
1
2
|
ALTER INDEX [MixedTest_CL] ON [MixedTest] REBUILD; GO |
<snip> IAM: Single Page Allocations @0x0000000023B0A08E Slot 0 = (1:1820) Slot 1 = (1:446) Slot 2 = (1:1032) Slot 3 = (0:0) Slot 4 = (1:1035) Slot 5 = (1:1034) Slot 6 = (1:1037) Slot 7 = (1:1036) <snip> |
So the answer is no, an index rebuild does not remove mixed page allocations. Only trace flag 1118 does that.
But this is interesting – there are only 7 mixed pages in the singe-page slot array above. What happened? The answer is that the offline index rebuild ran in parallel, with each thread building a partial index, and then these are stitched together. The ‘stitching together’ operation will cause some of the non-leaf index pages to be deallocated as their contents are merged together. This explains the deallocated page that was originally tracked by entry 3 in the slot array.
Let’s try an offline index rebuild that forces a serial plan.
1
2
|
ALTER INDEX [MixedTest_CL] ON [MixedTest] REBUILD WITH (MAXDOP = 1); GO |
<snip> IAM: Single Page Allocations @0x0000000023B0A08E Slot 0 = (1:1822) Slot 1 = (1:1823) Slot 2 = (1:291) Slot 3 = (1:292) Slot 4 = (0:0) Slot 5 = (0:0) Slot 6 = (0:0) Slot 7 = (0:0) <snip> |
In this case there is only one index (i.e. no parallel mini indexes) being built so there are no pages being deallocated in the new index as there is no stitching operation. But why aren’t there 8 mixed pages? This is because during the build phase of the new index, the leaf-level pages are taken from bulk-allocated dedicated extents, regardless of the recovery model in use. The mixed pages are non-leaf index pages (which you can prove to yourself using DBCC PAGE).
For parallel and single-threaded online index operations, the same two patterns occur as for offline index rebuilds, even though the algorithm is slightly different.
Enjoy!
在线重建索引
ALTER INDEX [IX_ClassifyResult2] ON [dbo].[ClassifyResult2] REBUILD WITH (FILLFACTOR = 80, SORT_IN_TEMPDB = ON,ONLINE=ON);
参考文献:
http://technet.microsoft.com/en-us/library/ms188388.aspx
正文
本文主要讲解如何使用alter index来rebuild和reorganize索引来清除碎片,rebuild能够完全清除碎片,但是reorganize却不能。
Rebuild index
--1.准备实验数据 select * into Employee from AdventureWorks2008R2.HumanResources.Employee; --2.查看使用空间:Employee 290 72 KB 56 KB 8 KB 8 KB sp_spaceused Employee --3.创建聚集索引 create clustered index IX_BusinessEntityID on Employee(BusinessEntityID); --4.查看使用空间:Employee 290 80 KB 56 KB 16 KB 8 KB sp_spaceused Employee --5.索引重建,清除fragment,并设定fillfactor为60 ALTER INDEX ALL ON Employee REBUILD WITH (FILLFACTOR = 60, SORT_IN_TEMPDB = ON, STATISTICS_NORECOMPUTE = ON); --6.查看使用空间:Employee 290 144 KB 88 KB 16 KB 40 KB sp_spaceused Employee
结论
- 在创建索引以后,index从8变到16,说明索引占用物理磁盘,因此索引是一个physical object。
- 在重建索引并设定fillfactor为60以后,我们发现data空间变大,这是因为填充因子重新使得原来装满的data page现在只装60%
- fillfactor只在对已有数据create index和alter index rebuild的时候有用。对于普通的insert操作无效。
- fillfactor的取值是1-100,msnd上说取0跟取100一样,但是实际测试发现使用0报错。
reorganize index
ALTER INDEX ALL ON Employee REORGANIZE GO
两者的区别
Rebuilding an index drops and re-creates the index. This removes fragmentation, reclaims disk space by compacting the pages based on the specified or existing fill factor setting, and reorders the index rows in contiguous pages. When ALL is specified, all indexes on the table are dropped and rebuilt in a single transaction.
重新生成索引将会删除并重新创建索引。 这将根据指定的或现有的填充因子设置压缩页来删除碎片、回收磁盘空间,然后对连续页中的索引行重新排序。 如果指定 ALL,将删除表中的所有索引,然后在单个事务中重新生成。
Reorganizing an index uses minimal system resources. It defragments the leaf level of clustered and nonclustered indexes on tables and views by physically reordering the leaf-level pages to match the logical, left to right, order of the leaf nodes. Reorganizing also compacts the index pages. Compaction is based on the existing fill factor value.
重新组织索引使用最少系统资源重新组织索引。 通过对叶级页以物理方式重新排序,使之与叶节点的从左到右的逻辑顺序相匹配,进而对表和视图中的聚集索引和非聚集索引的叶级进行碎片整理。 重新组织还会压缩索引页。 压缩基于现有的填充因子值。
Rebuilding an index can be executed online or offline. Reorganizing an index is always executed online. To achieve availability similar to the reorganize option, you should rebuild indexes online.
rebulid index既可以在online又可以在offline下执行,而reorganize index只能在online下执行的。
Difference between rebuild index online and offline(PS:2012-9-11)
既然rebuild index既可以是online模式,也可以是offline模式,那么两者有什么区别呢。这个我们可以参考stackoverflow上面的一篇文章:What is the difference between OFFLINE and ONLINE index rebuild in SQL Server? 在这里我还是简要总结一下:
online模式下
rebuild index会复制旧索引来新建索引,此时旧的索引依然可以被读取和修改,但是所以在旧索引上的修改都会同步更新到新索引下。中间会有一些冲突解决机制,具体参考Online Index Operations 里面的Build Phase这一章节。然后在rebuild这个过程完整的时候,会对table上锁一段时间,在这段时间里会用新索引来替换旧索引,当这个过程完成以后再释放table上面的锁。如果索引列包含 LOB对象的话,在SQL Server 2005/2008/R2中rebuild index online会失败。在sql server 2012中,即使索引列包含LOB对象,也可以rebuild index online了,可以参考 Online Index Operations for indexes containing LOB columns.
offline模式下
rebuilde index会对table上锁,所有对这个table的读写操作都会被阻塞,在这期间新索引根据旧索引来创建,其实就是一个复制的过程,但是新索引没有碎片,最后使用新索引替换旧索引。当rebuild整个过程完成以后,table上面的锁才会被释放。