在RAC上部署OGG并配置OGG高可用
在RAC上部署OGG并配置OGG高可用 孔个个 2020-09-17 原文 目录 简介 环境信息 安装OGG 配置数据库 开启数据库级别日志补充 在dbdc1为OGG单独创建TNS 创建OGG管理用户及其表空间 配置OGG 设置OGG全局参数 Source端,OGG设置, 配置管理进程 创建checkpoint表 添加跟踪对象,pos下所有表 配置extract 配置extract 指定extract的trail位置 配置extract参数 查看 查看trail目录 配置投递进程,设置本地trail 为投递进程DP_pos1 设置远程trail目标 配置DP_pos1参数文件 启动OGG 在RAC上配置OGG的高可用 查询当前RAC网络信息 在GRID中添加OGG的VIP资源 将OGGVIP资源授权给oracle用户 查看状态并启动OGGVIP 在ACFS上创建适用于grid的ogg控制脚本 使用oracle用户添加oggapp,并授权给oracle用户管理 使用GRID启动OGG failover测试 简介 由于业务系统要与大数据平台进行对接,需要将Oracle DB的数据同步到异构数据库上,故选用也不得不用上了Goldengate方案 然鹅,OGG在RAC上的HA配置一直众说纷纭,我搜索了下发现多数为single node的配置。几经探索,在这里将OGG在RAC的HA配置整理起来,为需要的伙伴提供一点思路 半部分数据已进行脱敏。 环境信息 源端数据库信息 HOSTNAME IP DB/schema dbdc1 10.0.24.131 (scan:10.0.24.135\136\137) CTCN/POS dbdc2 10.0.24.132 (scan:10.0.24.135\136\137) CTCN/POS 源端数据库版本:Oracle DB 12.1.0.1 RAC (SE) OGG版本:OGG-12.2.0.1 计划的OGG VIP:10.0.24.138 目标端:OGG for bigdata 目标端地址:10.18.0.41 安装OGG 配置ACFS,mount到/u01/ogg 配置oracle用户环境变量 [root@dbdc1 ogg]# ~oracle/.bash_profile 新增 export OGG_HOME=/u01/ogg/gg PATH=$PATH:$HOME/bin:$ORACLE_HOME/bin:$ORACLE_HOME/OPatch:$OGG_HOME export LD_LIBRARY_PATH=$ORACLE_HOME/lib 创建目录并设置权限 [root@dbdc1 ogg]# mdkir /u01/ogg/gg [root@dbdc1 ogg]# chown -R oracle:oinstall /u01/ogg 下载OGG 12.2.0.1 安装包并上传 [root@dbdc1 ogg]# ls -lh /u01/software/V100692-01.zip -rw-r--r-- 1 oracle oinstall 454M Sep 9 14:25 /u01/software/V100692-01.zip 解压 [root@dbdc1 software]# unzip V100692-01.zip -d ogg 使用oracle用户,通过X11安装 [oracle@dbdc1 Disk1]$ pwd /u01/software/ogg/fbo_ggs_Linux_x64_shiphome/Disk1 [oracle@dbdc1 Disk1]$ ./runInstaller 指定安装位置为ACFS的挂载点/u01/ogg/gg 完成安装 配置数据库 开启数据库级别日志补充 [oracle@dbdc1 ~]$ export ORACLE_SID=CTCN1 [oracle@dbdc1 ~]$ sqlplus / as sysdba SQL> ALTER DATABASE FORCE LOGGING; Database altered. SQL> ALTER DATABASE ADD SUPPLEMENTAL LOG DATA; Database altered. SQL> ALTER SYSTEM ARCHIVE LOG CURRENT; System altered. SQL> col open_mode for a10 SQL> col FORCE_LOGGING for a10 SQL> SELECT name,open_mode,force_logging,supplemental_log_data_min FROM v$database; NAME OPEN_MODE FORCE_LOGG SUPPLEMENTAL_LOG_DATA_MI --------------------------- ---------- ---------- ------------------------ CTCN READ WRITE YES YES 在dbdc1为OGG单独创建TNS 这一步并不是必要的,如果OGG只负责一个database,那么可以不配置TNS 由于我这里的案例是要为多个instance做extract,因此每个extract连接不同的database,就要用到TNS CTCN_OGG = (DESCRIPTION = (ADDRESS = (PROTOCOL = TCP)(HOST = 10.0.24.136)(PORT = 1521)) (CONNECT_DATA = (SERVER = DEDICATED) (SERVICE_NAME = CTCN) ) ) 创建OGG管理用户及其表空间 SQL> create tablespace ogg datafile '+ASM_DAT1' size 1M autoextend on maxsize 10G; Tablespace created. SQL> create user ogg default tablespace ogg identified by ogg; User created. SQL> grant connect,resource,dba to ogg; Grant succeeded. 配置OGG 设置OGG全局参数 [oracle@dbdc1 ~]$ cd /u01/ogg/gg/ [oracle@dbdc1 gg]$ ggsci GGSCI (dbdc1) 1> EDIT PARAMS ./GLOBALS GGSCHEMA ogg Source端,OGG设置, 配置管理进程 GGSCI (dbdc1) 2> EDIT PARAM MGR PORT 7809 AUTOSTART ER * AUTORESTART ER *,RETRIES 3,WAITMINUTES 5,RESETMINUTES 60 LAGREPORTHOURS 1 LAGINFOMINUTES 3 LAGCRITICALMINUTES 10 PURGEOLDEXTRACTS /u01/ogg/datdir/*,USECHECKPOINTS,MINKEEPDAYS 1 GGSCI (dbdc1) 8> DBLOGIN userid ogg@CTCN1_OGG, password ogg Successfully logged into database. 创建checkpoint表 GGSCI (dbdc1 as ogg@CTCN1) 9> add checkpointtable ogg.ggs_checkpoint Successfully created checkpoint table ogg.ggs_checkpoint. 添加跟踪对象,pos下所有表 GGSCI (dbdc1 as ogg@CTCN1) 11> ADD TRANDATA pos.* ... ... 配置extract 源端是双节点RAC,此处设置参数THREADS 2 配置extract ADD EXTRACT ext_pos1,TRANLOG,BEGIN NOW,THREADS 2 指定extract的trail位置 ADD EXTTRAIL /u01/ogg/datdir/po, EXTRACT ext_pos1 MEGABYTES 50 配置extract参数 源端是双节点RAC,此处设置参数 TRANLOGOPTIONS DBLOGREADER GGSCI (dbdc1 as ogg@CTCN1) 5> EDIT PARAMS ext_pos1 EXTRACT ext_pos1 USERID ogg@CTCN_OGG, PASSWORD ogg TRANLOGOPTIONS DBLOGREADER EXTTRAIL /u01/ogg/datdir/po TABLE pos.*; 查看 GGSCI (dbdc1) 9> info all Program Status Group Lag at Chkpt Time Since Chkpt MANAGER RUNNING EXTRACT STOPPED EXT_pos1 00:00:00 00:00:36 GGSCI (dbdc1) 10> start ext ext_pos1 GGSCI (dbdc1) 11> info all Program Status Group Lag at Chkpt Time Since Chkpt MANAGER RUNNING EXTRACT RUNNING EXT_pos1 00:00:00 00:00:07 查看trail目录 [oracle@dbdc1 datdir]$ pwd /u01/ogg/datdir [oracle@dbdc1 datdir]$ ll -h total 1.2M -rw-r----- 1 oracle oinstall 1.1M Sep 10 10:15 po000000000 配置投递进程,设置本地trail GGSCI (dbdc1) 1> ADD EXTRACT DP_pos1 EXTTRAILSOURCE /u01/ogg/datdir/po EXTRACT added. 为投递进程DP_pos1 设置远程trail目标 GGSCI (dbdc1) 2> ADD RMTTRAIL /data/ogg_app/dirdat/po, EXTRACT DP_pos1 RMTTRAIL added. 配置DP_pos1参数文件 GGSCI (dbdc1) 3> edit param DP_pos1 EXTRACT DP_pos1 USERID ogg@CTCN_OGG, PASSWORD ogg RMTHOST 10.18.0.41, MGRPORT 7809 RMTTRAIL /data/ogg_app/dirdat/po TABLE pos.*; 启动OGG GGSCI (dbdc1) 4> start dp_pos1 GGSCI (dbdc1) 5> info all Program Status Group Lag at Chkpt Time Since Chkpt MANAGER RUNNING EXTRACT RUNNING EXT_pos1 00:00:00 00:00:03 EXTRACT RUNNING DP_pos1 00:00:00 00:00:02 检查目标端,应用trail file,配置完成。 在RAC上配置OGG的高可用 VIP:10.0.24.138 VIP_name:oggvip 查询当前RAC网络信息 [grid@dbdc1 ~]$ crsctl stat res -p |grep -ie .network -ie subnet |grep -ie name -ie subnet REGISTRATION_INVITED_SUBNETS= REGISTRATION_INVITED_SUBNETS= REGISTRATION_INVITED_SUBNETS= NAME=ora.net1.network USR_ORA_SUBNET=10.0.24.0 在GRID中添加OGG的VIP资源 [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/appvipcfg create -network=1 -ip=10.0.24.138 -vipname=oggvip -user=root Production Copyright 2007, 2008, Oracle.All rights reserved 2020-09-14 15:57:52: Creating Resource Type 2020-09-14 15:57:52: Executing /u01/grid/product/12.1.0/grid/bin/crsctl add type app.appvip_net1.type -basetype ora.cluster_vip_net1.type -file /u01/grid/product/12.1.0/grid/crs/template/appvip.type 2020-09-14 15:57:52: Executing cmd: /u01/grid/product/12.1.0/grid/bin/crsctl add type app.appvip_net1.type -basetype ora.cluster_vip_net1.type -file /u01/grid/product/12.1.0/grid/crs/template/appvip.type 2020-09-14 15:57:52: Command output: > CRS-2728: A resource type with the name 'app.appvip_net1.type' is already registered > CRS-4000: Command Add failed, or completed with errors. >End Command output CRS-2728: A resource type with the name 'app.appvip_net1.type' is already registered CRS-4000: Command Add failed, or completed with errors. 2020-09-14 15:57:52: Create the Resource 2020-09-14 15:57:52: Executing /u01/grid/product/12.1.0/grid/bin/crsctl add resource oggvip -type app.appvip_net1.type -attr "USR_ORA_VIP=10.0.24.138,START_DEPENDENCIES=hard(ora.net1.network) pullup(ora.net1.network),STOP_DEPENDENCIES=hard(ora.net1.network),ACL='owner:root:rwx,pgrp:root:r-x,other::r--,user:root:r-x',HOSTING_MEMBERS=dbdc1,APPSVIP_FAILBACK=" 2020-09-14 15:57:52: Executing cmd: /u01/grid/product/12.1.0/grid/bin/crsctl add resource oggvip -type app.appvip_net1.type -attr "USR_ORA_VIP=10.0.24.138,START_DEPENDENCIES=hard(ora.net1.network) pullup(ora.net1.network),STOP_DEPENDENCIES=hard(ora.net1.network),ACL='owner:root:rwx,pgrp:root:r-x,other::r--,user:root:r-x',HOSTING_MEMBERS=dbdc1,APPSVIP_FAILBACK=" 将OGGVIP资源授权给oracle用户 [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl setperm resource oggvip -u user:oracle:r-x 查看状态并启动OGGVIP [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl status resource oggvip NAME=oggvip TYPE=app.appvip_net1.type TARGET=OFFLINE STATE=OFFLINE [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl start resource oggvip CRS-2672: Attempting to start 'oggvip' on 'dbdc1' CRS-2676: Start of 'oggvip' on 'dbdc1' succeeded [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl status resource oggvip NAME=oggvip TYPE=app.appvip_net1.type TARGET=ONLINE STATE=ONLINE on dbdc1 [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl stat res -t -------------------------------------------------------------------------------- Cluster Resources -------------------------------------------------------------------------------- oggvip 1 ONLINE ONLINE dbdc1 STABLE 在ACFS上创建适用于grid的ogg控制脚本 mkdir /u01/ogg/gg/scripts vi ogg_act.scr #!/bin/sh #set the Oracle Goldengate installation directory export OGG_HOME=/u01/ogg/gg #set the oracle home to the database to ensure GoldenGate will get the #right environment settings to be able to connect to the database export ORACLE_HOME=/u01/app/oracle/product/12.1.0/db_1 #specify delay after start before checking for successful start start_delay_secs=5 #Include the GoldenGate home in the library path to start GGSCI export LD_LIBRARY_PATH=$ORACLE_HOME/lib:${OGG_HOME}:${LD_LIBRARY_PATH} #check_process validates that a manager process is running at the PID #that GoldenGate specifies. check_process () { if ( [ -f "${OGG_HOME}/dirpcs/MGR.pcm" ] ) then pid=`cut -f8 "${OGG_HOME}/dirpcs/MGR.pcm"` if [ ${pid} = `ps -e |grep ${pid} |grep mgr |cut -d " " -f2` ] then #manager process is running on the PID exit success exit 0 else if [ ${pid} = `ps -e |grep ${pid} |grep mgr |cut -d " " -f1` ] then #manager process is running on the PID exit success exit 0 else #manager process is not running on the PID exit 1 fi fi else #manager is not running because there is no PID file exit 1 fi } #call_ggsci is a generic routine that executes a ggsci command call_ggsci () { ggsci_command=$1 ggsci_output=`${OGG_HOME}/ggsci<<EOF ${ggsci_command} exit EOF` } case $1 in 'start') #start manager call_ggsci 'start manager' #there is a small delay between issuing the start manager command #and the process being spawned on the OS. wait before checking sleep ${start_delay_secs} #check whether manager is running and exit accordingly check_process ;; 'stop') #attempt a clean stop for all non-manager processes #call_ggsci 'stop er *' #ensure everything is stopped call_ggsci 'stop er *!' #call_ggsci 'kill er *' #stop manager without (y/n) confirmation call_ggsci 'stop manager!' #exit success exit 0 ;; 'check') check_process ;; 'clean') #attempt a clean stop for all non-manager processes #call_ggsci 'stop er *' #ensure everything is stopped #call_ggsci 'stop er *!' #in case there are lingering processes call_ggsci 'kill er *' #stop manager without (y/n) confirmation call_ggsci 'stop manager!' #exit success exit 0 ;; 'abort') #ensure everything is stopped call_ggsci 'stop er *!' #in case there are lingering processes call_ggsci 'kill er *' #stop manager without (y/n) confirmation call_ggsci 'stop manager!' #exit success exit 0 ;; esac 使用oracle用户添加oggapp,并授权给oracle用户管理 [oracle@dbdc1 scripts]$ ls /u01/ogg/gg/scripts/ogg_act.scr /u01/ogg/gg/scripts/ogg_act.scr [oracle@dbdc1 scripts]$ chmod +x /u01/ogg/gg/scripts/ogg_act.scr [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl add resource oggapp -type cluster_resource \ > -attr "ACTION_SCRIPT=/u01/ogg/gg/scripts/ogg_act.scr, \ > CHECK_INTERVAL=30, START_DEPENDENCIES='hard(oggvip,ora.asm) \ > pullup(oggvip)', STOP_DEPENDENCIES='hard(oggvip)'" [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl status resource oggapp NAME=oggapp TYPE=cluster_resource TARGET=OFFLINE STATE=OFFLINE [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl setperm resource oggapp -o oracle 使用GRID启动OGG [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl status resource oggapp NAME=oggapp TYPE=cluster_resource TARGET=OFFLINE STATE=OFFLINE [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl start resource oggapp CRS-2679: Attempting to clean 'oggapp' on 'dbdc1' CRS-2681: Clean of 'oggapp' on 'dbdc1' succeeded CRS-2672: Attempting to start 'oggapp' on 'dbdc1' CRS-2676: Start of 'oggapp' on 'dbdc1' succeeded [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl status resource oggapp NAME=oggapp TYPE=cluster_resource TARGET=ONLINE STATE=ONLINE on dbdc1 尝试一下停止 [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl stop resource oggapp CRS-2673: Attempting to stop 'oggapp' on 'dbdc1' CRS-2677: Stop of 'oggapp' on 'dbdc1' succeeded GGSCI (dbdc1) 2> info all Program Status Group Lag at Chkpt Time Since Chkpt MANAGER STOPPED EXTRACT STOPPED DP_POS1 00:00:00 00:00:03 EXTRACT STOPPED EXT_POS1 00:00:02 00:00:01 尝试一下启动 [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl start resource oggapp CRS-2672: Attempting to start 'oggapp' on 'dbdc1' CRS-2676: Start of 'oggapp' on 'dbdc1' succeeded GGSCI (dbdc1) 3> info all Program Status Group Lag at Chkpt Time Since Chkpt MANAGER RUNNING EXTRACT RUNNING DP_POS1 00:00:00 00:00:03 EXTRACT RUNNING EXT_POS1 00:00:01 00:00:06 grid控制成功 failover测试 注意将tnsnames配置同步到另一个节点上 [root@dbdc1 ~]# /u01/grid/product/12.1.0/grid/bin/crsctl relocate resource oggapp -f CRS-2673: Attempting to stop 'oggapp' on 'dbdc1' CRS-2677: Stop of 'oggapp' on 'dbdc1' succeeded CRS-2673: Attempting to stop 'oggvip' on 'dbdc1' CRS-2677: Stop of 'oggvip' on 'dbdc1' succeeded CRS-2672: Attempting to start 'oggvip' on 'dbdc2' CRS-2676: Start of 'oggvip' on 'dbdc2' succeeded CRS-2672: Attempting to start 'oggapp' on 'dbdc2' CRS-2676: Start of 'oggapp' on 'dbdc2' succeeded GGSCI (dbdc1) 4> info all Program Status Group Lag at Chkpt Time Since Chkpt MANAGER RUNNING EXTRACT STOPPED DP_POS1 00:00:00 00:00:28 EXTRACT STOPPED EXT_POS1 00:00:00 00:00:27 GGSCI (dbdc2) 1> info all Program Status Group Lag at Chkpt Time Since Chkpt MANAGER RUNNING EXTRACT RUNNING DP_POS1 00:00:00 00:00:01