KingbaseES V8R3 集群运维系列 -- 备库数据迁移单实例库

案例说明:

在生产环境需要将集群中架构转换为单实例环境,本案例以备库转换为单实例库为案例,介绍了两种方案,一种在数据库数据量小的环境下采用 sys_dumpall 导出导入方式建立单实例库;另外一种是在数据量大的情况下,采用物理copy数据文件的方式迁移到单实例库。

适用版本:

KingbaseES V8R3

案例架构:

查看集群状态:

1、集群节点状态

TEST=# show pool_nodes;                   
 node_id |   hostname    | port  | status | lb_weight |  role   | select_cnt | load_balance_node | replication_delay 
---------+---------------+-------+--------+-----------+---------+------------+-------------------+-------------------
 0       | 192.168.7.243 | 54321 | up     | 0.500000  | primary | 0          | false             | 0
 1       | 192.168.7.248 | 54321 | up     | 0.500000  | standby | 0          | true              | 0
(2 rows)

2)流复制状态

TEST=# select * from sys_stat_replication;
 PID  | USESYSID | USENAME | APPLICATION_NAME |  CLIENT_ADDR  | CLIENT_HOSTNAME | CLIENT_PORT |         BACKEND_START         | BACKEND_XMIN |   STATE   | SENT_LOCATION | WRITE_LOCATION | FLUSH_LOCATION | REPLAY_LOCATION | SYNC_PRIORITY | SYNC_STATE 
------+----------+---------+------------------+---------------+-----------------+-------------+-------------------
 3393 |       10 | SYSTEM  | node2            | 192.168.7.248 |                 |        9968 | 2021-03-01 12:09:12.662830+08 |              | streaming | 0/B0CC040     | 0/B0CC040      | 0/B0CC040      | 0/B0CC040       |      2 | sync
(1 row)

案例一: sys_dumpall导出导入方式(用于数据量小的架构,在生产环境下,不能保证单实例库的数据和集群中的数据是一致的)。

1、集群测试数据

PROD=# create table t1 (id int ,name varchar(10));
CREATE TABLE
PROD=# insert into t1 values(generate_series(1,10000),'usr'||generate_series(1,10000));
INSERT 0 10000
PROD=# select count(*) from t1;
 COUNT 
-------
 10000
(1 row)

2、在备库主机创建新的实例

# 创建单实例库数据目录
[kingbase@node1 bin]$ mkdir -p /data/kingbase/v8r3c/data

# 初始化单实例库
[kingbase@node1 bin]$ pwd
/home/kingbase/cluster/ha/db/bin

[kingbase@node1 bin]$ ./initdb -U system -W 123456 -E utf8 --case-insensitive -D /data/kingbase/v8r3c/data
.....
Success. You can now start the database server using:

    ./sys_ctl -D /data/kingbase/v8r3c/data -l logfile start

3、从备库备份数据(sys_dumpall)

[kingbase@node1 bin]$ ./sys_dumpall -U SYSTEM -W 123456 > ~/db.sql
sys_dumpall: role name starting with "sys_" skipped (SYSSAO)
sys_dumpall: role name starting with "sys_" skipped (SYSSSO)
sys_dumpall: role name starting with "sys_" skipped (SYSTEM)

4、启动单实例数据库并导入数据

# 启动数据库服务
[kingbase@node1 bin]$ ./sys_ctl start -D /data/kingbase/v8r3c/data/
server starting
.......

[kingbase@node1 bin]$ netstat -an |grep 54323
tcp        0      0 0.0.0.0:54323           0.0.0.0:*               LISTEN     
tcp6       0      0 :::54323                :::*                    LISTEN     
unix  2      [ ACC ]     STREAM     LISTENING     42092    /tmp/.s.KINGBASE.54323


# 连接数据库并导入数据
 
# 导入备库备份数据
test=# \i /home/kingbase/db.sql
SET
SET
SET
CREATE DATABASE
ksql:/home/kingbase/db.sql:25: ERROR:  database "SECURITY" already exists
REVOKE
GRANT
You are now connected to database "PROD" as user "system".
SET
SET
SET
SET
SET
SET
   set_config    
-----------------
 "$USER", PUBLIC
(1 row)
.......

5、查看单实例库数据

TEST=# \c PROD
You are now connected to database "PROD" as user "system".
PROD=# \d
                    List of relations
 Schema |             Name              | Type  | Owner  
--------+-------------------------------+-------+--------
 PUBLIC | pathman_cache_stats           | view  | system
 PUBLIC | pathman_concurrent_part_tasks | view  | system
 PUBLIC | pathman_config                | table | system
 PUBLIC | pathman_config_params         | table | system
 PUBLIC | pathman_partition_list        | view  | system
 PUBLIC | T1                            | table | system
(6 rows)

PROD=# select count(*) from t1;
 count 
-------
 10000
(1 row)

=== 从以上可以获知,集群中的数据已经导入到单实例库中,此时可以停止备库或删除集群===

案例2:物理拷贝方式(如果单实例库数据需要和集群数据一致,需要停止整个集群;如果不需要一致,只需要停止备库数据库服务即可)。

1、集群测试数据

PROD=# \d
                    List of relations
 Schema |             Name              | Type  | Owner  
--------+-------------------------------+-------+--------
 PUBLIC | PATHMAN_CACHE_STATS           | view  | SYSTEM
 PUBLIC | PATHMAN_CONCURRENT_PART_TASKS | view  | SYSTEM
 PUBLIC | PATHMAN_CONFIG                | table | SYSTEM
 PUBLIC | PATHMAN_CONFIG_PARAMS         | table | SYSTEM
 PUBLIC | PATHMAN_PARTITION_LIST        | view  | SYSTEM
 PUBLIC | T1                            | table | SYSTEM
 PUBLIC | T2                            | table | SYSTEM
 PUBLIC | T3                            | table | SYSTEM
(8 rows)

PROD=# select count(*) from t1;
 COUNT 
-------
 10000
(1 row)
.......

2、备库生成检查点

PROD=# checkpoint;
CHECKPOINT

3、停止备库cron任务和数据库服务

[root@node1 ~]# cat /etc/cron.d/KINGBASECRON 
.......
#*/1 * * * * root  /home/kingbase/cluster/ha/db/bin/network_rewind.sh
#*/1 * * * * root  /home/kingbase/cluster/ha/kingbasecluster/bin/restartcluster.sh

# 停止备库数据库服务
[kingbase@node1 bin]$ ./sys_ctl stop -D ../data
waiting for server to shut down.... done
server stopped

4、建立单实例库

# 创建单实例库数据目录
[kingbase@node1 bin]$ mkdir -p /data/kingbase/v8r3c/data

# 初始化单实例库
Tips:
   单实例库的数据库模式、大小写支持、字符集等要和原集群库一致。

[kingbase@node1 bin]$ ./initdb -U system -W 123456 -E utf8 --case-insensitive -D /data/kingbase/v8r3c/data
The files belonging to this database system will be owned by user "kingbase".
This user must also own the server process.
......

5、拷贝备库数据到单实例库

[kingbase@node1 v8r3c]$ mv data data.bk
[kingbase@node1 v8r3c]$ mkdir data

# 拷贝备库数据到单实例库
[kingbase@node1 data]$ cp -var * /data/kingbase/v8r3c/data/
‘backup_label.old’ -> ‘/data/kingbase/v8r3c/data/backup_label.old’
‘base/1/1255’ -> ‘/data/kingbase/v8r3c/data/base/1/1255’
‘base/1/1255_fsm’ -> ‘/data/kingbase/v8r3c/data/base/1/1255_fsm’
‘base/1/1247’ -> ‘/data/kingbase/v8r3c/data/base/1/1247’
‘base/1/1247_fsm’ -> ‘/data/kingbase/v8r3c/data/base/1/1247_fsm’
.......

6、修改data下的recovery.conf文件
[kingbase@node1 data]$ mv recovery.conf recovery.conf.bk

7、启动单实例库数据库服务

[kingbase@node1 bin]$ ./sys_ctl start -D /data/kingbase/v8r3c/data
server starting
[kingbase@node1 bin]$ LOG:  redirecting log output to logging collector process
HINT:  Future log output will appear in directory "/home/kingbase/cluster/ha/db/data/sys_log".

[kingbase@node1 bin]$ ps -ef |grep kingbase

kingbase 21482     1  1 14:08 pts/1    00:00:00 /home/kingbase/cluster/ha/db/bin/kingbase -D /data/kingbase/v8r3c/data
kingbase 21484 21482  0 14:08 ?        00:00:00 kingbase: logger process   
kingbase 21487 21482  0 14:08 ?        00:00:00 kingbase: checkpointer process   
kingbase 21488 21482  0 14:08 ?        00:00:00 kingbase: writer process   
kingbase 21489 21482  0 14:08 ?        00:00:00 kingbase: wal writer process   
kingbase 21490 21482  0 14:08 ?        00:00:00 kingbase: autovacuum launcher process   
kingbase 21491 21482  0 14:08 ?        00:00:00 kingbase: archiver process   
kingbase 21492 21482  0 14:08 ?        00:00:00 kingbase: stats collector process   
kingbase 21493 21482  0 14:08 ?        00:00:00 kingbase: bgworker: syslogical supervisor   
......

8、验证和查看单实例库数据

PROD=# \d
                    List of relations
 Schema |             Name              | Type  | Owner  
--------+-------------------------------+-------+--------
 PUBLIC | PATHMAN_CACHE_STATS           | view  | SYSTEM
 PUBLIC | PATHMAN_CONCURRENT_PART_TASKS | view  | SYSTEM
 PUBLIC | PATHMAN_CONFIG                | table | SYSTEM
 PUBLIC | PATHMAN_CONFIG_PARAMS         | table | SYSTEM
 PUBLIC | PATHMAN_PARTITION_LIST        | view  | SYSTEM
 PUBLIC | T1                            | table | SYSTEM
 PUBLIC | T2                            | table | SYSTEM
 PUBLIC | T3                            | table | SYSTEM
(8 rows)

PROD=# select count(*) from t1;
 COUNT 
-------
 10000
(1 row)

=== 从以上获知,备库的数据已经同步到单实例库===

总结:

   1、本案例是在原集群架构外,另外创建单实例,完成从集群到单实例库的转换,也可以直接在原集群上转换,不再赘述。
   2、转换为单实例库后,可以按照生产要求 ,删除或保留原集群架构。
   3、另外集群主库转换为单实例库,方法基本和此案例一致。
posted @ 2022-01-28 09:03  KINGBASE研究院  阅读(183)  评论(0编辑  收藏  举报