DBCC WRITEPAGE: an introduction
DBCC WRITEPAGE: an introduction
http://www.sqlskills.com/blogs/paul/dbcc-writepage/
Check out my online training courses: SQL Server: Detecting and Correcting Database Corruption and SQL Server: Advanced Corruption Recovery Techniques. We can also help you with disaster recovery.)
SQL Server’s undocumented commands, options, and trace flags are sometimes carefully-guarded secrets. I’ve been a major secret-keeper of lots of these, having written/re-written a bunch of the undocumented DBCC commands. I’ve been assiduously avoiding writing this blog post for 14 years but now my hand has been forced.
There’s one command in particular, DBCC WRITEPAGE, that has been effectively hidden since it was introduced pre-SQL Server 2000. However, more and more people are learning about it since some of the exam training books publicized trace flag 2588 that allows DBCC HELP (‘?’) to show all the undocumented DBCC commands, and their syntax (it used to be trace flag 2520, but I changed it for SQL Server 2005 to help obfuscate access to the undocumented DBCCs).
Some people are even starting to blog and present about the more obscure commands, including DBCC WRITEPAGE. I asked, in vain, for DBCC WRITEPAGE to not be publicized so I now feel compelled to blog about it so there is authoritative information available about it (I rewrote and extended its functionality for SQL Server 2005). I discussed this issue with the SQL product team and CSS last week, and they agree with me doing this. You won’t see a Microsoft post about this, but my great friend Bob Ward from CSS wanted me to quote him saying “Microsoft does not support the usage of DBCC WRITEPAGE and its use would invalidate the support for a customer database on which it is used.”
I present to you DBCC WRITEPAGE – the most dangerous command you can use in SQL Server.
*** DISCLAIMER: The information in this blog post should not be used on a production SQL Server system. Any problem, corruption, damage, or loss you cause by using the information presented here is entirely your own responsibility. Use at your own risk. Danger of death. ***
Well, not danger of death :-)
DBCC WRITEPAGE allows you alter any byte on any page in any database, as long as you have sysadmin privileges. It also allows you to completely circumvent the buffer pool, in other words you can force page checksum failures – this was the part I wrote from scratch, and I’m rather proud of its sneakiness.
The purposes of DBCC WRITEPAGE are:
- To allow automated testing of DBCC CHECKDB and repair by the SQL Server team.
- To engineer corruptions for demos and testing.
- To allow for last-ditch disaster recovery by manually editing a live, corrupt database.
I use it for #2 and #3 (and obviously used to do #1). I most recently used it on a live client system in October 2012 after a massive, multi-site SAN failure trashed databases and backups. I do not advise that you attempt #3 unless you’re confident you know what you’re doing and the side-effects on the Storage Engine from the byte(s) that you’re changing.
It is a very dangerous command because:
- It is an entirely physical change to the page – nothing is logged in the transaction log, and it cannot be rolled back.
- You can change any page in any database. For instance, you could use it to modify a page in master so that the instance immediately shuts down and will not start until master is rebuilt and restored.
- Anyone with sysadmin privileges can use it and there is no way to stop it.
- It breaks the support of your database.
You can very easily shoot yourself in the foot very badly playing around with this command. This isn’t hyperbole – it’s just the truth.
I’m going to do a series of posts showing the use of this command – as there are some cool things you can do with it for demo purposes.
(Toolbars to easily copy and paste code have been deliberately disabled in this post.)
The syntax is:
1
2
3
4
|
DBCC TRACEON (2588); GO DBCC HELP ( 'WRITEPAGE' ); GO |
dbcc WRITEPAGE ({'dbname' | dbid}, fileid, pageid, offset, length, data [, directORbufferpool]) |
The parameters mean:
- ‘dbname’ | dbid : self-explanatory
- fileid : file ID containing the page to change
- pageid : zero-based page number within that file
- offset : zero-based offset in bytes from the start of the page
- length : number of bytes to change, from 1 to 8
- data : the new data to insert (in hex, in the form ’0xAABBCC’ – example three-byte string)
- directORbufferpool : whether to bypass the buffer pool or not (0/1)
I’m actually a bit dismayed that someone added the final option to the syntax print-out, as it’s the most dangerous option – and the part I added in SQL Server 2005. If you specify ’1′ for the directORbufferpool parameter, DBCC WRITEPAGE does the following:
- Checkpoints the database and flushes all its pages out of the buffer pool
- Unhooks SQL Server’s FCB (File Control Block) from the data file
- Creates its own FCB for the file
- Does a direct read of the page into DBCC’s memory
- Modifies the page directly
- Writes the page directly to disk, bypassing the buffer pool and any page protection generation (i.e. not recalculating the page checksum)
- Fixes up the FCBs again
So the buffer pool knows nothing about the change to the page – it’s as if the I/O subsystem changed/corrupted the page. If you don’t specify that parameter, or specify ’0′, the change occurs on the page in the buffer pool and the page checksum will be calculated correctly when the page is written to disk (albeit with other corruption issues maybe).
This means that you can force page checksum failures from within SQL Server – great for demos and testing. This is how I create corrupt databases.
To finish off this introductory post I’ll show you how to do just that, using a non-production server please…
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
|
CREATE DATABASE [CorruptDB]; GO USE [CorruptDB]; GO CREATE TABLE [Test] ( [c1] INT IDENTITY, [c2] CHAR (8000) DEFAULT 'a' ); GO INSERT INTO [Test] DEFAULT VALUES ; GO DBCC IND (N 'CorruptDB' , N 'Test' , -1); GO |
PageFID PagePID IAMFID IAMPID ObjectID ... ------- ----------- ------ ----------- ----------- ... 1 154 NULL NULL 101575400 ... 1 153 1 154 101575400 ... |
I’m going to corrupt the data page – page (1:153):
1
2
3
4
|
ALTER DATABASE [CorruptDB] SET SINGLE_USER; GO DBCC WRITEPAGE (N 'CorruptDB' , 1, 153, 4000, 1, 0x45, 1); GO |
Now the database is corrupt and I’ve circumvented the buffer pool so the page checksum is incorrect now – watch…
1
2
|
SELECT * FROM [Test]; GO |
c1 c2 ----------- ---------- ... Msg 824, Level 24, State 2, Line 1 SQL Server detected a logical consistency-based I/O error: incorrect checksum (expected: 0x41fb2e55; actual: 0x41fb4b55). It occurred during a read of page (1:153) in database ID 29 at offset 0x00000000132000 in file 'C:\Program Files\Microsoft SQL Server\MSSQL10.MSSQLSERVER\MSSQL\DATA\CorruptDB.mdf'. Additional messages in the SQL Server error log or system event log may provide more detail. This is a severe error condition that threatens database integrity and must be corrected immediately. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online. |
Nice eh? Yes, if that’s what you’re trying to do.
In the error log:
1
2
|
EXEC xp_readerrorlog; GO |
2013-02-05 14:32:56.760 spid55 Starting up database 'CorruptDB'. 2013-02-05 14:36:37.320 spid55 Setting database option SINGLE_USER to ON for database CorruptDB. 2013-02-05 14:36:37.320 spid55 User "APPLECROSS\paul" is modifying bytes 4000 to 4001 of page (1:153) in database "CorruptDB". 2013-02-05 14:37:55.160 spid55 Error: 824, Severity: 24, State: 2. 2013-02-05 14:37:55.160 spid55 SQL Server detected a logical consistency-based I/O error: blah blah blah |
I’m not going to discuss how the use of DBCC WRITEPAGE is tracked within the database itself.
I’m going to blog a bunch more about DBCC WRITEPAGE over the next few months. If you’re going to play with it, please do not use it on a production system, and be very, very careful. It’s very easy to do something disastrous. And remember, if you use DBCC WRITEPAGE, you do so entirely at your own risk.
PS I really struggled with whether to post this or not, but in the end, given that it has and will continue to be posted by others, I want some authoritative information out there with warnings. There are all kinds of things you can do to destroy SQL Server data using Windows and the I/O subsystem (e.g. formatting an array) – doesn’t mean you go try it on a production system though without realizing the consequences. The same principle applies here. It isn’t irresponsible to post this information in the face of it becoming public anyway – better to have the complete story IMHO.
ALTER DATABASE [YieldTest] SET SINGLE_USER WITH ROLLBACK IMMEDIATE; GO DBCC WRITEPAGE (N'YieldTest', 1, 9, 4000, 1, 0x45, 1); GO ALTER DATABASE [YieldTest] SET MULTI_USER GO SELECT * FROM [dbo].[SampleTable] 消息 5069,级别 16,状态 1,第 1 行 ALTER DATABASE 语句失败。 消息 824,级别 24,状态 2,第 1 行 SQL Server 检测到基于一致性的逻辑 I/O 错误 校验和不正确(应为: 0xb47b9a72,但实际为: 0xb47bdf72)。在文件 'D:\Program Files\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQL\DATA\YieldTest.mdf' 中、偏移量为 0x00000000012000 的位置对数据库 ID 13 中的页 (1:9) 执行 读取 期间,发生了该错误。SQL Server 错误日志或系统事件日志中的其他消息可能提供了更详细信息。这是一个威胁数据库完整性的严重错误条件,必须立即纠正。请执行完整的数据库一致性检查(DBCC CHECKDB)。此错误可以由许多因素导致;有关详细信息,请参阅 SQL Server 联机丛书。