交换机在江湖】 设备异常重启故障案例
涉及产品和版本 框式V200R006C00SPC500 组网情况 组网信息不涉及,仅单设备故障。 现象描述 S9300在早上7点20左右的时间出现了一次异常重启,任意视图执行命令display reset-reason,设备未记录重启原因;执行命令display version,发现设备启动后工作时间为38分钟,说明设备的确重启过。 <HUAWEI> display reset-reason Info: The LPU frame[1] board[1] does not have reset records. Info: The LPU frame[1] board[2] does not have reset records. Info: The LPU frame[1] board[3] does not have reset records. Info: The LPU frame[1] board[4] does not have reset records. Info: The LPU frame[1] board[5] does not have reset records. <HUAWEI> display version Huawei Versatile Routing Platform Software VRP (R) software, Version 5.160 (S9300 V200R006C00SPC500) Copyright (C) 2000-2017 HUAWEI TECH CO., LTD HUAWEI S9303 Terabit Routing Switch uptime is 0 week, 0 day, 0 hours, 38 minutes ... ... 原因分析 设备异常启动的一些原因并不能全部由display reset-reason记录,还需要继续排查日志告警信息,进一步确认。 处理步骤 查看日志信息,发现早上7点18分,设备记录如下日志告警信息,设备冷启动一次,存在掉电重启。需要进一步排查设备供电环境,经确认是外部供电环境异常导致。 <HUAWEI> display trapbuffer Trapping buffer configuration and contents : enabled Allowed max buffer size : 1024 Actual buffer size : 256 Channel number : 3 , Channel name : trapbuffer Dropped messages : 0 Overwritten messages : 70 Current messages : 256 #May 12 2017 07:18:00 cuqiao9303 ENTMIB/4/TRAP: OID 1.3.6.1.2.1.47.2.0.1 Entity MIB change. #May 12 2017 07:18:00 cuqiao9303 SNMP/4/COLDSTART:OID 1.3.6.1.6.3.1.1.5.1 coldStart. #May 12 2017 07:17:58 cuqiao9303 BASETRAP/4/POWERON: OID 1.3.6.1.4.1.2011.5.25.129.2.3.2 The power supply is on.(Index=69206025, Severity=6, ProbableCause=1024, EventType=5, ContainedIn=69206021, PhysicalName="PWR2") #May 12 2017 07:17:58 cuqiao9303 BASETRAP/4/POWERON: OID 1.3.6.1.4.1.2011.5.25.129.2.3.2 The power supply is on.(Index=68943881, Severity=6, ProbableCause=1024, EventType=5, ContainedIn=68943877, PhysicalName="PWR1") 总结与建议 设备异常重启,不仅需要看display reset-reason命令回显信息,还需排查日志告警信息。本帖最后由 交换机在江湖 于 2017-07-12 16:15 编辑 |