hp的硬件监控
hp的硬件监控
安装hp-health
注意:hp-health这个包是需要和厂商要的,因为包和服务器的型号是有一定的对应关系的,需要报给厂商拿对应机型的包
yum install -y hp-health ssacli
启动服务
systemctl status hp-health
systemctl start hp-health
systemctl enable hp-health
查看信息
- 查看内存状态信息
sudo sudo hpasmcli -s "show DIMM"
DIMM Configuration
------------------
Processor #: 1
Module #: 9
Present: Yes
Form Factor: 9h
Memory Type: DDR4(1ah)
Size: 16384 MB # 内存的大小
Speed: 2400 MHz
Supports Lock Step: No
Configured for Lock Step: No
Status: Ok # 状态值 这个也是内存的监控指标
- 查看风扇信息
sudo sudo hpasmcli -s "show FANS"
Fan Location Present Speed of max Redundant Partner Hot-pluggable
--- -------- ------- ----- ------ --------- ------- -------------
#1 SYSTEM Yes NORMAL 9% Yes 0 Yes
#2 SYSTEM Yes NORMAL 11% Yes 0 Yes
#3 SYSTEM Yes NORMAL 18% Yes 0 Yes
#4 SYSTEM Yes NORMAL 23% Yes 0 Yes
#5 SYSTEM Yes NORMAL 23% Yes 0 Yes
#6 SYSTEM Yes NORMAL 23% Yes 0 Yes
- 查看事件日志
sudo hpasmcli -s "show IML"
Event: 30 Added: 07/12/2018 13:08
INFO: POST Messages - Option ROM POST Information: 1792-Slot 0 Drive Array - Valid Data Found in Write-Back Cache. Data will automatically be written to drive array. Action: No action required..
- 查看服务器的启动方式顺序
sudo hpasmcli -s "show IPL"
IPL (Standard Boot Order)
-------------------------
#0 HDD
#1 PXE
#2 PXE
#3 CDROM
#4 USBKEY
- 电源状态信息查看
sudo hpasmcli -s "show POWERSUPPLY"
Power supply #1
Present : Yes
Redundant: Yes
Condition: Ok # 状态 监控值
Hotplug : Supported
Power : 80 Watts
Power supply #2
Present : Yes
Redundant: Yes
Condition: Ok
Hotplug : Supported
Power : 75 Watts
- 处理器信息查看
sudo hpasmcli -s "show SERVER"
System : ProLiant DL380 Gen9
Serial No. : 6CU8041MKD
ROM version : v2.52 (10/25/2017) P89
UEFI Support : Yes
iLo present : Yes
Embedded NICs : 2
NIC1 MAC: b4:96:91:20:f1:b4
NIC2 MAC: b4:96:91:20:f1:b6
Processor: 0
Name : Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
Stepping : 1
Speed : 2100 MHz
Bus : 100 MHz
Core : 8
Thread : 16
Socket : 1
Level1 Cache : 512 KBytes
Level2 Cache : 2048 KBytes
Level3 Cache : 20480 KBytes
Status : Ok
Processor: 1
Name : Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
Stepping : 1
Speed : 2100 MHz
Bus : 100 MHz
Core : 8
Thread : 16
Socket : 2
Level1 Cache : 512 KBytes
Level2 Cache : 2048 KBytes
Level3 Cache : 20480 KBytes
Status : Ok
Processor total : 2
Memory installed : 65536 MBytes
ECC supported : Yes
- 温度信息查看
sudo hpasmcli -s "show TEMP"
Temp是实时读取的值, Threshold是阈值,当temp> Threshold时报警
Sensor Location Temp Threshold
------ -------- ---- ---------
#1 AMBIENT 19C/66F 42C/107F
#2 PROCESSOR_ZONE 40C/104F 70C/158F
#3 PROCESSOR_ZONE 40C/104F 70C/158F
#4 MEMORY_BD - -
#5 MEMORY_BD 33C/91F 89C/192F
#6 MEMORY_BD - -
#7 MEMORY_BD 30C/86F 89C/192F
#8 SYSTEM_BD 35C/95F 60C/140F
#9 SYSTEM_BD - -
#10 SYSTEM_BD 43C/109F 105C/221F
#11 POWER_SUPPLY_BAY 31C/87F -
#12 POWER_SUPPLY_BAY 31C/87F -
#13 SYSTEM_BD 38C/100F 115C/239F
#14 SYSTEM_BD 38C/100F 115C/239F
#15 SYSTEM_BD 32C/89F 115C/239F
#16 SYSTEM_BD 33C/91F 115C/239F
#17 SYSTEM_BD 32C/89F 115C/239F
#18 SYSTEM_BD 31C/87F 115C/239F
#19 POWER_SUPPLY_BAY 40C/104F -
#20 POWER_SUPPLY_BAY 40C/104F -
#21 I/O_ZONE - -
#22 I/O_ZONE - -
#23 I/O_ZONE - -
#24 I/O_ZONE - -
#25 I/O_ZONE - -
#26 I/O_ZONE - -
#27 I/O_ZONE 63C/145F 100C/212F
#28 I/O_ZONE - -
#29 SYSTEM_BD - -
#30 AMBIENT 29C/84F 65C/149F
#31 I/O_ZONE 32C/89F 70C/158F
#32 I/O_ZONE 33C/91F 70C/158F
#33 I/O_ZONE 34C/93F 70C/158F
#34 I/O_ZONE - -
#35 I/O_ZONE - -
#36 I/O_ZONE - -
#37 I/O_ZONE 44C/111F 75C/167F
#38 SYSTEM_BD 36C/96F 75C/167F
#39 SYSTEM_BD 35C/95F 70C/158F
#40 SYSTEM_BD 36C/96F 75C/167F
#41 SYSTEM_BD 37C/98F 90C/194F
#42 SYSTEM_BD - -
#43 SYSTEM_BD 26C/78F 60C/140F
#44 POWER_SUPPLY_BAY 35C/95F 100C/212F
阵列信息查看
ssacli 的具体是使用直接help一下就可以
- 查看阵列信息
sudo ssacli ctrl all show config
Smart Array P840ar in Slot 0 (Embedded) (sn: PVYKH0BRHAA0ID)
Port Name: 1I
Port Name: 2I
Internal Drive Cage at Port 1I, Box 3, OK
Internal Drive Cage at Port 1I, Box 3, OK
Internal Drive Cage at Port 2I, Box 2, OK
Internal Drive Cage at Port 2I, Box 2, OK
Array A (SAS, Unused Space: 0 MB) # 阵列1 是raid1
logicaldrive 1 (279.4 GB, RAID 1, OK) # 阵列的状态
physicaldrive 1I:3:1 (port 1I:box 3:bay 1, SAS HDD, 300 GB, OK) # 阵列下边磁盘的状态
physicaldrive 1I:3:2 (port 1I:box 3:bay 2, SAS HDD, 300 GB, OK)
Array B (SAS, Unused Space: 0 MB) # 阵列2 是raid5
logicaldrive 2 (18.0 TB, RAID 5, OK) # 阵列的状态
physicaldrive 1I:3:3 (port 1I:box 3:bay 3, SAS HDD, 1.8 TB, OK) # 阵列下边磁盘的状态
physicaldrive 1I:3:4 (port 1I:box 3:bay 4, SAS HDD, 1.8 TB, OK)
physicaldrive 1I:3:5 (port 1I:box 3:bay 5, SAS HDD, 1.8 TB, OK)
physicaldrive 1I:3:6 (port 1I:box 3:bay 6, SAS HDD, 1.8 TB, OK)
physicaldrive 1I:3:7 (port 1I:box 3:bay 7, SAS HDD, 1.8 TB, OK)
physicaldrive 1I:3:8 (port 1I:box 3:bay 8, SAS HDD, 1.8 TB, OK)
physicaldrive 2I:2:1 (port 2I:box 2:bay 1, SAS HDD, 1.8 TB, OK)
physicaldrive 2I:2:2 (port 2I:box 2:bay 2, SAS HDD, 1.8 TB, OK)
physicaldrive 2I:2:3 (port 2I:box 2:bay 3, SAS HDD, 1.8 TB, OK)
physicaldrive 2I:2:4 (port 2I:box 2:bay 4, SAS HDD, 1.8 TB, OK)
physicaldrive 2I:2:5 (port 2I:box 2:bay 5, SAS HDD, 1.8 TB, OK)
physicaldrive 2I:2:6 (port 2I:box 2:bay 6, SAS HDD, 1.8 TB, OK)