SQLSERVER排查CPU占用高的情况
今天中午,有朋友叫我?guī)退匆幌聰?shù)據(jù)庫,操作系統(tǒng)是Windows2008R2 ,數(shù)據(jù)庫是SQL2008R2 64位
64G內(nèi)存,16核CPU
硬件配置還是比較高的,他說服務(wù)器運(yùn)行的是金蝶K3軟件,數(shù)據(jù)庫實例里有多個數(shù)據(jù)庫
現(xiàn)象
他說是這幾天才出現(xiàn)的,而且在每天的某一個時間段才會出現(xiàn)CPU占用高的情況
內(nèi)存占用不太高,只占用了30個G
CPU占用100%
排查方向
一般排查都是用下面的腳本,一般會用到三個視圖sys.sysprocesses ,dm_exec_sessions ,dm_exec_requests
1 USE master
2 GO
3 --如果要指定數(shù)據(jù)庫就把注釋去掉
4 SELECT FROM sys.[sysprocesses] WHERE [spid]>50 --AND DB_NAME([dbid])='gposdb'
5 SELECT COUNT() FROM [sys].[dm_exec_sessions] WHERE [session_id]>50
看一下當(dāng)前的數(shù)據(jù)庫用戶連接有多少
然后使用下面語句看一下各項指標(biāo)是否正常,是否有阻塞,這個語句選取了前10個最耗CPU時間的會話
復(fù)制代碼
1 SELECT TOP 10
2 [session_id],
3 [request_id],
4 [start_time] AS '開始時間',
5 [status] AS '狀態(tài)',
6 [command] AS '命令',
7 dest.[text] AS 'sql語句',
8 DB_NAME([database_id]) AS '數(shù)據(jù)庫名',
9 [blocking_session_id] AS '正在阻塞其他會話的會話ID',
10 [wait_type] AS '等待資源類型',
11 [wait_time] AS '等待時間',
12 [wait_resource] AS '等待的資源',
13 [reads] AS '物理讀次數(shù)',
14 [writes] AS '寫次數(shù)',
15 [logical_reads] AS '邏輯讀次數(shù)',
16 [row_count] AS '返回結(jié)果行數(shù)'
17 FROM sys.[dm_exec_requests] AS der
18 CROSS APPLY
19 sys.dm_exec_sql_text AS dest
20 WHERE [session_id]>50 AND DB_NAME(der.[database_id])='gposdb'
21 ORDER BY [cpu_time] DESC
復(fù)制代碼
如果想看具體的SQL語句可以執(zhí)行下面的SQL語句,記得在SSMS里選擇以文本格式顯示結(jié)果
復(fù)制代碼
1 --在SSMS里選擇以文本格式顯示結(jié)果
2 SELECT TOP 10
3 dest.[text] AS 'sql語句'
4 FROM sys.[dm_exec_requests] AS der
5 CROSS APPLY
6 sys.dm_exec_sql_text AS dest
7 WHERE [session_id]>50
8 ORDER BY [cpu_time] DESC
復(fù)制代碼
模擬了一些耗CPU時間的動作
還有查看CPU數(shù)和user scheduler數(shù)和最大工作線程數(shù),檢查worker是否用完也可以排查CPU占用情況
1 --查看CPU數(shù)和user scheduler數(shù)目
2 SELECT cpu_count,scheduler_count FROM sys.dm_os_sys_info
3 --查看最大工作線程數(shù)
4 SELECT max_workers_count FROM sys.dm_os_sys_info
查看機(jī)器上的所有schedulers包括user 和system
通過下面語句可以看到worker是否用完,當(dāng)達(dá)到最大線程數(shù)的時候就要檢查blocking了
對照下面這個表
各種CPU和SQLSERVER版本組合自動配置的最大工作線程數(shù)
CPU數(shù) 32位計算機(jī) 64位計算機(jī)
<=4 256 512
8 288 576
16 352 704
32 480 960
復(fù)制代碼
1 SELECT
2 scheduler_address,
3 scheduler_id,
4 cpu_id,
5 status,
6 current_tasks_count,
7 current_workers_count,active_workers_count
8 FROM sys.dm_os_schedulers
復(fù)制代碼
如果SQLSERVER存在要等待的資源,那么執(zhí)行下面語句就會顯示出會話中有多少個worker在等待
結(jié)合[sys].[dm_os_wait_stats]視圖,如果當(dāng)前SQLSERVER里面沒有任何等待資源,那么下面的SQL語句不會顯示任何結(jié)果
復(fù)制代碼
1 SELECT TOP 10
2 [session_id],
3 [request_id],
4 [start_time] AS '開始時間',
5 [status] AS '狀態(tài)',
6 [command] AS '命令',
7 dest.[text] AS 'sql語句',
8 DB_NAME([database_id]) AS '數(shù)據(jù)庫名',
9 [blocking_session_id] AS '正在阻塞其他會話的會話ID',
10 der.[wait_type] AS '等待資源類型',
11 [wait_time] AS '等待時間',
12 [wait_resource] AS '等待的資源',
13 [dows].[waiting_tasks_count] AS '當(dāng)前正在進(jìn)行等待的任務(wù)數(shù)',
14 [reads] AS '物理讀次數(shù)',
15 [writes] AS '寫次數(shù)',
16 [logical_reads] AS '邏輯讀次數(shù)',
17 [row_count] AS '返回結(jié)果行數(shù)'
18 FROM sys.[dm_exec_requests] AS der
19 INNER JOIN [sys].[dm_os_wait_stats] AS dows
20 ON der.[wait_type]=[dows].[wait_type]
21 CROSS APPLY
22 sys.dm_exec_sql_text AS dest
23 WHERE [session_id]>50
24 ORDER BY [cpu_time] DESC
復(fù)制代碼
比如我當(dāng)前執(zhí)行了查詢SalesOrderDetail_test表100次,由于表數(shù)據(jù)非常多,所以SSMS需要把SQLSERVER執(zhí)行的結(jié)果慢慢的取走,
造成了ASYNC_NETWORK_IO等待
1 USE [AdventureWorks]
2 GO
3 SELECT * FROM dbo.[SalesOrderDetail_test]
4 GO 100
問題源頭
經(jīng)過排查和這幾天的觀察情況,確定是某些表缺失索引導(dǎo)致,現(xiàn)在在這些表上增加了索引,問題解決
復(fù)制代碼
1 select from t_AccessControl --權(quán)限控制表權(quán)限控制
2 select from t_GroupAccess --用戶組權(quán)限表用戶組權(quán)限
3 select from t_GroupAccessType --用戶組權(quán)限類表用戶組權(quán)限類
4 select from t_ObjectAccess --對象權(quán)限表對象權(quán)限
5 select from t_ObjectAccessType --對象權(quán)限類型表對象權(quán)限類型
6 select from t_ObjectType --對象類型表對象類型
復(fù)制代碼
查詢CPU占用高的語句
復(fù)制代碼
1 SELECT TOP 10
2 total_worker_time/execution_count AS avg_cpu_cost, plan_handle,
3 execution_count,
4 (SELECT SUBSTRING(text, statement_start_offset/2 + 1,
5 (CASE WHEN statement_end_offset = -1
6 THEN LEN(CONVERT(nvarchar(max), text)) * 2
7 ELSE statement_end_offset
8 END - statement_start_offset)/2)
9 FROM sys.dm_exec_sql_text(sql_handle)) AS query_text
10 FROM sys.dm_exec_query_stats
11 ORDER BY [avg_cpu_cost] DESC
復(fù)制代碼
查詢?nèi)笔饕?/p>
復(fù)制代碼
1 SELECT
2 DatabaseName = DB_NAME(database_id)
3 ,[Number Indexes Missing] = count()
4 FROM sys.dm_db_missing_index_details
5 GROUP BY DB_NAME(database_id)
6 ORDER BY 2 DESC;
復(fù)制代碼
復(fù)制代碼
1 SELECT TOP 10
2 [Total Cost] = ROUND(avg_total_user_cost avg_user_impact * (user_seeks + user_scans),0)
3 , avg_user_impact
4 , TableName = statement
5 , [EqualityUsage] = equality_columns
6 , [InequalityUsage] = inequality_columns
7 , [Include Cloumns] = included_columns
8 FROM sys.dm_db_missing_index_groups g
9 INNER JOIN sys.dm_db_missing_index_group_stats s
10 ON s.group_handle = g.index_group_handle
11 INNER JOIN sys.dm_db_missing_index_details d
12 ON d.index_handle = g.index_handle
13 ORDER BY [Total Cost] DESC;
復(fù)制代碼
定位問題后,新建非聚集索引
復(fù)制代碼
1 CREATE NONCLUSTERED INDEX IX_t_AccessControl_F4 ON dbo.t_AccessControl
2 (
3 FObjectType
4 )include([FUserID], [FAccessType], [FAccessMask]) WITH( STATISTICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON) ON [PRIMARY]
5 GO
6
7 drop index IX_t_AccessControl_F4 on t_AccessControl
復(fù)制代碼
CPU占用恢復(fù)正常
跟蹤模板和跟蹤文件下載,請使用SQL2008R2 版本:files.cnblogs.com/lyhabc/跟蹤模板和trace.rar
總結(jié)
從多次歷史經(jīng)驗來看,如果CPU負(fù)載持續(xù)很高,但內(nèi)存和IO都還好的話,這種情況下,首先想到的一定是索引問題,十有八九錯不了。
注意文章開頭貼出的客戶機(jī)器負(fù)載情況圖
轉(zhuǎn)載于:https://blog.51cto.com/1546594/2121075
總結(jié)
以上是生活随笔為你收集整理的SQLSERVER排查CPU占用高的情况的全部內(nèi)容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: Kubernetes Dashboard
- 下一篇: 教程:一起学习Hystrix--服务(依