代码改变世界

【OGG 故障处理】OGG-01031

2019-05-30 17:36  那个,我  阅读(3438)  评论(0编辑  收藏  举报
故障原因
--------------------
网络异常,导致DP进程异常中断
 
故障现象
--------------------
源端DP 进程全部挂起,且启动失败
GGSCI 34> info all
Program Status Group Lag at Chkpt Time Since Chkpt
MANAGER RUNNING
EXTRACT ABENDED DP_XXXX1 00:00:00 00:49:33
EXTRACT ABENDED DP_XXXX2 00:00:00 00:49:40
EXTRACT ABENDED DP_XXXX3 00:00:00 00:49:32
EXTRACT ABENDED DP_XXXX4 00:00:00 00:49:37
EXTRACT ABENDED DP_XXXX5 00:00:00 00:49:34
EXTRACT RUNNING EX_XXXX1 00:00:01 00:00:09
EXTRACT RUNNING EX_XXXX2 00:00:01 00:00:09
EXTRACT RUNNING EX_XXXX3 00:00:01 00:00:00
EXTRACT RUNNING EX_XXXX4 00:00:01 00:00:10
EXTRACT RUNNING EX_XXXX5 00:00:01 00:00:00
 
错误描述
--------------------
源端:
GGSCI 34> view report dp_XXXX1
2018-02-07 22:24:27 ERROR OGG-01031 There is a problem in network communication, a remote file problem, encryption keys for target and source do not match (if using ENCRYPT) or an unknown error. (Reply received is Unable
to open file "/ogg/dirdat/ra000128" (error 11, Resource temporarily unavailable)).
2018-02-07 22:24:27 ERROR OGG-01668 PROCESS ABENDING.
 
目标端
[ogg@target ogg]$ tail -200 ggserr.log | grep WARNING
2018-02-07 22:56:27 WARNING OGG-01223 Oracle GoldenGate Collector for Oracle: Unable to lock file "/ogg/dirdat/rb000075" (error 11, Resource temporarily unavailable). Lock currently held by process id (PID) 28854.
2018-02-07 22:56:32 WARNING OGG-01223 Oracle GoldenGate Collector for Oracle: Unable to lock file "/ogg/dirdat/rc001573" (error 11, Resource temporarily unavailable). Lock currently held by process id (PID) 28915.
2018-02-07 22:56:37 WARNING OGG-01223 Oracle GoldenGate Collector for Oracle: Unable to lock file "/ogg/dirdat/rd000476" (error 11, Resource temporarily unavailable). Lock currently held by process id (PID) 28926.
2018-02-07 22:56:42 WARNING OGG-01223 Oracle GoldenGate Collector for Oracle: Unable to lock file "/ogg/dirdat/re000005" (error 11, Resource temporarily unavailable). Lock currently held by process id (PID) 28936.
 
处理方案
--------------------
目标端:
根据上面的目标端报错Lock currently held by process id (PID) 28854.
kill -9 pid
 
源端:
启动DP 进程,完成恢复