Centos7配置TiDB集群
一:各模块属性
模块名称 | 状态 | 建议实例数 | 功能 | 负载均衡组件 |
TiDB | 无状态 | 2 | 接收SQL请求,处理SQL相关逻辑,并通过PB找到存储数据的TiKV地址 | LVS、HAProxy、F5 |
PB | 集群 | 3 奇数个节点,推荐>3 | 整个集群的管理模块,存储元信息、对TiKV集群进行调度和负载均衡、分配全局事务ID | Raft |
TiKV | 集群 | 3 | 负责存储数据 | Raft |
二:环境要求
2.1:开发测试环境:
组件 | CPU | 内存 | 本地存储 | 网络 | 实例数量(最低要求) |
---|---|---|---|---|---|
TiDB | 8核+ | 16 GB+ | 无特殊要求 | 千兆网卡 | 1(可与 PD 同机器) |
PD | 4核+ | 8 GB+ | SAS, 200 GB+ | 千兆网卡 | 1(可与 TiDB 同机器) |
TiKV | 8核+ | 32 GB+ | SSD, 200 GB+ | 千兆网卡 | 3 |
2.2:生产环境:
组件 | CPU | 内存 | 硬盘类型 | 网络 | 实例数量(最低要求) |
---|---|---|---|---|---|
TiDB | 16核+ | 32 GB+ | SAS | 万兆网卡(2块最佳) | 2 |
PD | 4核+ | 8 GB+ | SSD | 万兆网卡(2块最佳) | 3 |
TiKV | 16核+ | 32 GB+ | SSD | 万兆网卡(2块最佳) | 3 |
监控 | 8核+ | 16 GB+ | SAS | 千兆网卡 | 1 |
2.3:端口说明:
组件 | 默认端口 | 说明 |
---|---|---|
TiDB | 4000 | 应用及 DBA 工具访问通信端口 |
TiDB | 10080 | TiDB 状态信息上报通信端口 |
TiKV | 20160 | TiKV 通信端口 |
PD | 2379 | 提供 TiDB 和 PD 通信端口 |
PD | 2380 | PD 集群节点间通信端口 |
Pump | 8250 | Pump 通信端口 |
Drainer | 8249 | Drainer 通信端口 |
Prometheus | 9090 | Prometheus 服务通信端口 |
Pushgateway | 9091 | TiDB,TiKV,PD 监控聚合和上报端口 |
Node_exporter | 9100 | TiDB 集群每个节点的系统信息上报通信端口 |
Blackbox_exporter | 9115 | Blackbox_exporter 通信端口,用于 TiDB 集群端口监控 |
Grafana | 3000 | Web 监控服务对外服务和客户端(浏览器)访问端口 |
Grafana | 8686 | grafana_collector 通信端口,用于将 Dashboard 导出为 PDF 格式 |
Kafka_exporter | 9308 | Kafka_exporter 通信端口,用于监控 binlog kafka 集群 |
三:环境部署:
3.1:群架构
xm-tidb-01 192.168.1.1
xm-pd-01 192.168.1.2
xm-tikv-01 192.168.1.3
xm-tikv-02 192.168.1.4
xm-tikv-03 192.168.1.5
3.2:创建ext4盘:(所有节点)
参照:https://www.cnblogs.com/jackyzm/p/10402275.html
vim mount-ext4.sh
#!/bin/sh #https://www.cnblogs.com/jackyzm/p/10402275.html ##缩小home空间到5G mkdir /homebak sleep 1 cp -r /home /homebak sleep 5 umount /home lvremove /dev/mapper/centos-home -y sleep 3 lvcreate -L 5G -n home centos -y sleep 3 mkfs.xfs /dev/mapper/centos-home sleep 15 mount /dev/mapper/centos-home /home ##新建ext4分区 lvcreate -L 20G -n ext4 centos -y sleep 3 mkfs.ext4 /dev/mapper/centos-ext4 sleep 15 lsblk -f
./mount-ext4.sh
vim /etc/fstab 添加
UUID=a45530a9-8b07-4b9e-b78b-d6480e239dea /ext4 ext4 defaults,nodelalloc,noatime 0 2
mkdir /ext4
mount -a
mount -t ext4
3.3:安装依赖包(在主控机操作xm-tidb-01 192.168.1.1)
yum -y install epel-release git curl sshpass
yum install -y python-pip
升级pip
pip install --upgrade pip
[root@zz-01 /]# pip -V pip 19.0.2 from /usr/lib/python2.7/site-packages/pip (python 2.7)
3.4:创建tidb用户
useradd -m -d /home/tidb tidb
passwd tidb
3.5:配置tidb用户sudo免密码
vim /etc/sudoers
添加 tidb ALL=(ALL) NOPASSWD:ALL 到末尾
用:wq!保持并推出
3.6:切换用户
su - tidb
3.6:创建tidb用户ssh key
ssh-keygen -t rsa
[tidb@xm-tidb-01 ~]$ ssh-keygen -t rsa Generating public/private rsa key pair. Enter file in which to save the key (/home/tidb/.ssh/id_rsa): Created directory '/home/tidb/.ssh'. Enter passphrase (empty for no passphrase): Enter same passphrase again: Your identification has been saved in /home/tidb/.ssh/id_rsa. Your public key has been saved in /home/tidb/.ssh/id_rsa.pub. The key fingerprint is: SHA256:7oWdyfPJIGh10V8oJR3FB2BbSFJp9W3ZX4G/8leAl/4 tidb@xm-tidb-01 The key's randomart image is: +---[RSA 2048]----+ | .o**B*.| | ++*.oB| | ..+.ooO| | ..o++o| | S . o..o| | + = o ....| | o + O o..| | . . o = . .E| | . + .| +----[SHA256]-----+
四:在中控机部署TiDB-Ansible
4.1:各版本对应关系
tidb-ansible 分支 | TiDB 版本 | 备注 |
---|---|---|
release-2.0 | 2.0 版本 | 最新 2.0 稳定版本,可用于生产环境。 |
release-2.1 | 2.1 版本 | 最新 2.1 稳定版本,可用于生产环境(建议)。 |
master | master 版本 | 包含最新特性,每日更新。 |
以tidb用户进入/home/tidb
4.2:下载对应tidb-ansible版本:(本例下载2.1版本)
release-2.0版本:git clone -b release-2.0 https://github.com/pingcap/tidb-ansible.git
release-2.1版本:git clone -b release-2.1 https://github.com/pingcap/tidb-ansible.git
master版本: git clone https://github.com/pingcap/tidb-ansible.git
4.3:安装ansible及其依赖:
cd /home/tidb/tidb-ansible/
确定pip版本为19.0.2以上pip -V
[tidb@xm-tidb-01 tidb-ansible]$ pip -V pip 19.0.2 from /usr/lib/python2.7/site-packages/pip (python 2.7)
sudo pip install -r ./requirements.txt
ansible --version
[tidb@xm-tidb-01 tidb-ansible]$ ansible --version ansible 2.6.13 config file = /home/tidb/tidb-ansible/ansible.cfg configured module search path = [u'/home/tidb/.ansible/plugins/modules', u'/usr/share/ansible/plugins/modules'] ansible python module location = /usr/lib/python2.7/site-packages/ansible executable location = /bin/ansible python version = 2.7.5 (default, Oct 30 2018, 23:45:53) [GCC 4.8.5 20150623 (Red Hat 4.8.5-36)]
五:部署集群ssh互信及sudo规则:(tidb用户在中控机上操作)
5.1:添加列表
cd /home/tidb/tidb-ansible/
vim hosts.ini
[servers] 192.168.10.221 192.168.10.222 192.168.10.223 192.168.10.224 192.168.10.225 [all:vars] username = tidb ntp_server = pool.ntp.org
5.2: 执行以下命令,按提示输入部署目标机器 root
用户密码。该步骤将在部署目标机器上创建 tidb
用户,并配置 sudo 规则,配置中控机与部署目标机器之间的 ssh 互信。
ansible-playbook -i hosts.ini create_users.yml -u root -k
[tidb@xm-tidb-01 tidb-ansible]$ ansible-playbook -i hosts.ini create_users.yml -u root -k SSH password: PLAY [all] ************************************************************************************************* TASK [create user] ***************************************************************************************** changed: [192.168.10.225] changed: [192.168.10.224] changed: [192.168.10.223] changed: [192.168.10.222] ok: [192.168.10.221] TASK [set authorized key] ********************************************************************************** changed: [192.168.10.221] changed: [192.168.10.225] changed: [192.168.10.224] changed: [192.168.10.222] changed: [192.168.10.223] TASK [update sudoers file] ********************************************************************************* changed: [192.168.10.221] changed: [192.168.10.223] changed: [192.168.10.224] changed: [192.168.10.222] changed: [192.168.10.225] PLAY RECAP ************************************************************************************************* 192.168.10.221 : ok=3 changed=2 unreachable=0 failed=0 192.168.10.222 : ok=3 changed=3 unreachable=0 failed=0 192.168.10.223 : ok=3 changed=3 unreachable=0 failed=0 192.168.10.224 : ok=3 changed=3 unreachable=0 failed=0 192.168.10.225 : ok=3 changed=3 unreachable=0 failed=0 Congrats! All goes well. :-)
六:在目标机上安装NTP服务
6.1:cd /home/tidb/tidb-ansible
6.2:ansible-playbook -i hosts.ini deploy_ntp.yml -u tidb -b
如机器未装ntp服务,脚本会自动安装
[tidb@xm-tidb-01 tidb-ansible]$ ansible-playbook -i hosts.ini deploy_ntp.yml -u tidb -b PLAY [all] ************************************************************************************************* TASK [get facts] ******************************************************************************************* ok: [192.168.10.221] ok: [192.168.10.225] ok: [192.168.10.224] ok: [192.168.10.222] ok: [192.168.10.223] TASK [RedHat family Linux distribution - make sure ntp, ntpstat have been installed] *********************** changed: [192.168.10.221] => (item=[u'ntp']) changed: [192.168.10.223] => (item=[u'ntp']) changed: [192.168.10.224] => (item=[u'ntp']) changed: [192.168.10.225] => (item=[u'ntp']) changed: [192.168.10.222] => (item=[u'ntp']) TASK [RedHat family Linux distribution - make sure ntpdate have been installed] **************************** ok: [192.168.10.221] => (item=[u'ntpdate']) ok: [192.168.10.222] => (item=[u'ntpdate']) ok: [192.168.10.223] => (item=[u'ntpdate']) ok: [192.168.10.224] => (item=[u'ntpdate']) ok: [192.168.10.225] => (item=[u'ntpdate']) TASK [Debian family Linux distribution - make sure ntp, ntpstat have been installed] *********************** TASK [Debian family Linux distribution - make sure ntpdate have been installed] **************************** TASK [RedHat family Linux distribution - make sure ntpd service has been stopped] ************************** ok: [192.168.10.221] ok: [192.168.10.225] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] TASK [Debian family Linux distribution - make sure ntp service has been stopped] *************************** TASK [Adjust Time | start to adjust time with pool.ntp.org] ************************************************ changed: [192.168.10.222] changed: [192.168.10.224] changed: [192.168.10.223] changed: [192.168.10.221] changed: [192.168.10.225] TASK [RedHat family Linux distribution - make sure ntpd service has been started] ************************** changed: [192.168.10.221] changed: [192.168.10.222] changed: [192.168.10.223] changed: [192.168.10.224] changed: [192.168.10.225] TASK [Debian family Linux distribution - Make sure ntp service has been started] *************************** PLAY RECAP ************************************************************************************************* 192.168.10.221 : ok=6 changed=3 unreachable=0 failed=0 192.168.10.222 : ok=6 changed=3 unreachable=0 failed=0 192.168.10.223 : ok=6 changed=3 unreachable=0 failed=0 192.168.10.224 : ok=6 changed=3 unreachable=0 failed=0 192.168.10.225 : ok=6 changed=3 unreachable=0 failed=0 Congrats! All goes well. :-)
七:在部署目标机配置cpufreq调节器模式:
7.1查看调节器模式
cpupower frequency-info --governors
[tidb@xm-tidb-01 tidb-ansible]$ cpupower frequency-info --governors analyzing CPU 0: available cpufreq governors: Not Available
返回 “Not Available”,表示当前系统不支持配置 CPUfreq,跳过该步骤即可。
八:分配机器资源,编辑inventory.ini文件:
8.1:vim /home/tidb/tidb-ansible/inventory.ini
## TiDB Cluster Part [tidb_servers] 192.168.10.221 [tikv_servers] 192.168.10.223 192.168.10.224 192.168.10.225 [pd_servers] 192.168.10.222 [spark_master] [spark_slaves] [lightning_server] [importer_server] ## Monitoring Part # prometheus and pushgateway servers [monitoring_servers] 192.168.10.221 [grafana_servers] 192.168.10.221 # node_exporter and blackbox_exporter servers [monitored_servers] 192.168.10.221 192.168.10.222 192.168.10.223 192.168.10.224 192.168.10.225 [alertmanager_servers] [kafka_exporter_servers] ## Binlog Part [pump_servers] [drainer_servers]
8.2:inventory.ini变量调整:
部署目录调整:
## Global variables [all:vars] #deploy_dir = /home/tidb/deploy deploy_dir = /ext4/deploy
如为某一服务单独设置部署目录,可在配置服务主机列表时配置主机变量,以 TiKV 节点为例,其他服务类推,请务必添加第一列别名,以免服务混布时混淆
TiKV1-1 ansible_host=172.16.10.4 deploy_dir=/data1/deploy
8.3:其他变量调整:True、False首字母要大写
变量 | 含义 |
---|---|
cluster_name | 集群名称,可调整 |
tidb_version | TiDB 版本,TiDB-Ansible 各分支默认已配置 |
process_supervision | 进程监管方式,默认为 systemd,可选 supervise |
timezone | 新安装 TiDB 集群第一次启动 bootstrap(初始化)时,将 TiDB 全局默认时区设置为该值。TiDB 使用的时区后续可通过 time_zone 全局变量和 session 变量来修改,参考时区支持。 默认为 Asia/Shanghai ,可选值参考 timzone 列表。 |
enable_firewalld | 开启防火墙,默认不开启,如需开启,请将部署建议-网络要求 中的端口加入白名单 |
enable_ntpd | 检测部署目标机器 NTP 服务,默认为 True,请勿关闭 |
set_hostname | 根据 IP 修改部署目标机器主机名,默认为 False |
enable_binlog | 是否部署 pump 并开启 binlog,默认为 False,依赖 Kafka 集群,参见 zookeeper_addrs 变量 |
zookeeper_addrs | binlog Kafka 集群的 zookeeper 地址 |
enable_slow_query_log | TiDB 慢查询日志记录到单独文件({{ deploy_dir }}/log/tidb_slow_query.log),默认为 False,记录到 tidb 日志 |
deploy_without_tidb | KV 模式,不部署 TiDB 服务,仅部署 PD、TiKV 及监控服务,请将 inventory.ini 文件中 tidb_servers 主机组 IP 设置为空。 |
alertmanager_target | 可选:如果你已单独部署 alertmanager,可配置该变量,格式:alertmanager_host:alertmanager_port |
grafana_admin_user | Grafana 管理员帐号用户名,默认为 admin |
grafana_admin_password | Grafana 管理员帐号密码,默认为 admin,用于 Ansible 导入 Dashboard 和创建 API Key,如后期通过 grafana web 修改了密码,请更新此变量 |
collect_log_recent_hours | 采集日志时,采集最近几个小时的日志,默认为 2 小时 |
enable_bandwidth_limit | 在中控机上从部署目标机器拉取诊断数据时,是否限速,默认为 True,与 collect_bandwidth_limit 变量结合使用 |
collect_bandwidth_limit | 在中控机上从部署目标机器拉取诊断数据时限速多少,单位: Kbit/s,默认 10000,即 10Mb/s,如果是单机多 TiKV 实例部署方式,需除以单机实例个数 |
九:部署任务:
9.1:确认inventory.ini中ansible_user = tidb
## Connection # ssh via normal user ansible_user = tidb
9.2:测试ssh互信
ansible -i inventory.ini all -m shell -a 'whoami'
[tidb@xm-tidb-01 tidb-ansible]$ ansible -i inventory.ini all -m shell -a 'whoami' 192.168.10.224 | SUCCESS | rc=0 >> tidb 192.168.10.223 | SUCCESS | rc=0 >> tidb 192.168.10.221 | SUCCESS | rc=0 >> tidb 192.168.10.222 | SUCCESS | rc=0 >> tidb 192.168.10.225 | SUCCESS | rc=0 >> tidb
9.3:测试sudo免密
ansible -i inventory.ini all -m shell -a 'whoami' -b
[tidb@xm-tidb-01 tidb-ansible]$ ansible -i inventory.ini all -m shell -a 'whoami' -b 192.168.10.224 | SUCCESS | rc=0 >> root 192.168.10.223 | SUCCESS | rc=0 >> root 192.168.10.221 | SUCCESS | rc=0 >> root 192.168.10.222 | SUCCESS | rc=0 >> root 192.168.10.225 | SUCCESS | rc=0 >> root
9.4:下载TiDB binary到中控机:
ansible-playbook local_prepare.yml
[tidb@xm-tidb-01 tidb-ansible]$ ansible-playbook local_prepare.yml PLAY [do local preparation] ******************************************************************************** TASK [local : Stop if ansible version is too low, make sure that the Ansible version is Ansible 2.4.2 or later, otherwise a compatibility issue occurs.] *** ok: [localhost] => { "changed": false, "msg": "All assertions passed" } TASK [local : create downloads and resources directories] ************************************************** changed: [localhost] => (item=/home/tidb/tidb-ansible/downloads) changed: [localhost] => (item=/home/tidb/tidb-ansible/resources) changed: [localhost] => (item=/home/tidb/tidb-ansible/resources/bin) TASK [local : create cert directory] *********************************************************************** TASK [local : create packages.yml] ************************************************************************* changed: [localhost] TASK [local : create specific deployment method packages.yml] ********************************************** changed: [localhost] TASK [local : include_vars] ******************************************************************************** ok: [localhost] TASK [local : include_vars] ******************************************************************************** ok: [localhost] TASK [local : detect outbound network] ********************************************************************* ok: [localhost] TASK [local : set outbound network fact] ******************************************************************* ok: [localhost] TASK [local : fail] **************************************************************************************** TASK [local : detect GFW] ********************************************************************************** ok: [localhost] TASK [local : set GFW fact] ******************************************************************************** ok: [localhost] TASK [local : download tidb binary] ************************************************************************ FAILED - RETRYING: download tidb binary (4 retries left). changed: [localhost] => (item={u'url': u'http://download.pingcap.org/tidb-v2.1.4-linux-amd64.tar.gz', u'version': u'v2.1.4', u'name': u'tidb'}) TASK [local : download common binary] ********************************************************************** changed: [localhost] => (item={u'url': u'http://download.pingcap.org/fio-3.8.tar.gz', u'checksum': u'sha256:15739abde7e74b59ac59df57f129b14fc5cd59e1e2eca2ce37b41f8c289c3d58', u'version': 3.8, u'name': u'fio'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/grafana_collector-latest-linux-amd64.tar.gz', u'version': u'latest', u'name': u'grafana_collector'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/kafka_exporter-1.1.0.linux-amd64.tar.gz', u'version': u'1.1.0', u'name': u'kafka_exporter'}) TASK [local : download diagnosis tools] ******************************************************************** changed: [localhost] => (item={u'url': u'http://download.pingcap.org/tidb-insight-v0.2.5-1-g99b8fea.tar.gz', u'version': u'v0.2.5-1-g99b8fea', u'name': u'tidb-insight'}) TASK [local : download cfssl binary] *********************************************************************** TASK [local : download cfssljson binary] ******************************************************************* TASK [local : include_tasks] ******************************************************************************* included: /home/tidb/tidb-ansible/roles/local/tasks/binary_deployment.yml for localhost TASK [local : download other binary] *********************************************************************** TASK [local : download other binary under gfw] ************************************************************* changed: [localhost] => (item={u'url': u'http://download.pingcap.org/prometheus-2.2.1.linux-amd64.tar.gz', u'version': u'2.2.1', u'name': u'prometheus'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/alertmanager-0.14.0.linux-amd64.tar.gz', u'version': u'0.14.0', u'name': u'alertmanager'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/node_exporter-0.15.2.linux-amd64.tar.gz', u'version': u'0.15.2', u'name': u'node_exporter'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/pushgateway-0.4.0.linux-amd64.tar.gz', u'version': u'0.4.0', u'name': u'pushgateway'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/grafana-4.6.3.linux-x64.tar.gz', u'version': u'4.6.3', u'name': u'grafana'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/blackbox_exporter-0.12.0.linux-amd64.tar.gz', u'version': u'0.12.0', u'name': u'blackbox_exporter'}) TASK [local : download TiSpark packages] ******************************************************************* changed: [localhost] => (item={u'url': u'http://download.pingcap.org/spark-2.3.2-bin-hadoop2.7.tgz', u'checksum': u'sha256:6246b20d95c7596a29fb26d5b50a3ae3163a35915bec6c515a8e183383bedc43', u'version': u'2.3.2', u'name': u'spark-2.3.2-bin-hadoop2.7.tgz'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/tispark-latest-linux-amd64.tar.gz', u'version': u'latest', u'name': u'tispark-latest.tar.gz'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/tispark-sample-data.tar.gz', u'version': u'latest', u'name': u'tispark-sample-data.tar.gz'}) TASK [local : unarchive third party binary] **************************************************************** changed: [localhost] => (item={u'url': u'https://github.com/prometheus/prometheus/releases/download/v2.2.1/prometheus-2.2.1.linux-amd64.tar.gz', u'version': u'2.2.1', u'name': u'prometheus'}) changed: [localhost] => (item={u'url': u'https://github.com/prometheus/alertmanager/releases/download/v0.14.0/alertmanager-0.14.0.linux-amd64.tar.gz', u'version': u'0.14.0', u'name': u'alertmanager'}) changed: [localhost] => (item={u'url': u'https://github.com/prometheus/node_exporter/releases/download/v0.15.2/node_exporter-0.15.2.linux-amd64.tar.gz', u'version': u'0.15.2', u'name': u'node_exporter'}) changed: [localhost] => (item={u'url': u'https://github.com/prometheus/blackbox_exporter/releases/download/v0.12.0/blackbox_exporter-0.12.0.linux-amd64.tar.gz', u'version': u'0.12.0', u'name': u'blackbox_exporter'}) changed: [localhost] => (item={u'url': u'https://github.com/prometheus/pushgateway/releases/download/v0.4.0/pushgateway-0.4.0.linux-amd64.tar.gz', u'version': u'0.4.0', u'name': u'pushgateway'}) changed: [localhost] => (item={u'url': u'https://s3-us-west-2.amazonaws.com/grafana-releases/release/grafana-4.6.3.linux-x64.tar.gz', u'version': u'4.6.3', u'name': u'grafana'}) TASK [local : unarchive tispark] *************************************************************************** changed: [localhost] TASK [local : unarchive tispark-sample-data] *************************************************************** changed: [localhost] TASK [local : cp monitoring binary] ************************************************************************ changed: [localhost] => (item=alertmanager) changed: [localhost] => (item=prometheus) changed: [localhost] => (item=node_exporter) changed: [localhost] => (item=pushgateway) changed: [localhost] => (item=blackbox_exporter) TASK [local : cp tispark] ********************************************************************************** changed: [localhost] TASK [local : cp tispark-sample-data] ********************************************************************** changed: [localhost] TASK [local : unarchive tidb binary] *********************************************************************** changed: [localhost] => (item={u'url': u'http://download.pingcap.org/tidb-v2.1.4-linux-amd64.tar.gz', u'version': u'v2.1.4', u'name': u'tidb'}) TASK [local : unarchive common binary] ********************************************************************* changed: [localhost] => (item={u'url': u'http://download.pingcap.org/fio-3.8.tar.gz', u'checksum': u'sha256:15739abde7e74b59ac59df57f129b14fc5cd59e1e2eca2ce37b41f8c289c3d58', u'version': 3.8, u'name': u'fio'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/grafana_collector-latest-linux-amd64.tar.gz', u'version': u'latest', u'name': u'grafana_collector'}) changed: [localhost] => (item={u'url': u'http://download.pingcap.org/kafka_exporter-1.1.0.linux-amd64.tar.gz', u'version': u'1.1.0', u'name': u'kafka_exporter'}) TASK [local : cp tidb binary] ****************************************************************************** changed: [localhost] => (item={u'url': u'http://download.pingcap.org/tidb-v2.1.4-linux-amd64.tar.gz', u'version': u'v2.1.4', u'name': u'tidb'}) TASK [local : cp fio binary] ******************************************************************************* changed: [localhost] => (item=fio) TASK [local : cp grafana_collector binary and fonts] ******************************************************* changed: [localhost] TASK [local : cp kafka_exporter binary] ******************************************************************** changed: [localhost] => (item=kafka_exporter) TASK [local : cp daemontools binary] *********************************************************************** TASK [local : cp tidb-insight tarball] ********************************************************************* changed: [localhost] TASK [local : clean up download dir] *********************************************************************** changed: [localhost] PLAY RECAP ************************************************************************************************* localhost : ok=30 changed=22 unreachable=0 failed=0
Congrats! All goes well. :-)
[tidb@xm-tidb-01 tidb-ansible]$
9.5:初始化系统环境
ansible-playbook bootstrap.yml
报错:内存不足
TASK [check_system_optional : Preflight check - Check TiDB server's RAM] *********************************** fatal: [192.168.10.221]: FAILED! => {"changed": false, "msg": "This machine does not have sufficient RAM to
run TiDB, at least 16000 MB."} NO MORE HOSTS LEFT ***************************************************************************************** to retry, use: --limit @/home/tidb/tidb-ansible/retry_files/bootstrap.retry PLAY RECAP ************************************************************************************************* 192.168.10.221 : ok=29 changed=10 unreachable=0 failed=1 192.168.10.222 : ok=29 changed=10 unreachable=0 failed=0 192.168.10.223 : ok=29 changed=10 unreachable=0 failed=0 192.168.10.224 : ok=29 changed=10 unreachable=0 failed=0 192.168.10.225 : ok=29 changed=10 unreachable=0 failed=0 localhost : ok=1 changed=0 unreachable=0 failed=0 ERROR MESSAGE SUMMARY ************************************************************************************** [192.168.10.221]: Ansible FAILED! => playbook: bootstrap.yml; TASK: check_system_optional : Preflight
check - Check TiDB server's RAM; message: {"changed": false, "msg": "This machine does not have sufficient
RAM to run TiDB, at least 16000 MB."} Ask for help: Contact us: support@pingcap.com It seems that you encounter some problems. You can send an email to the above email address, attached with
the tidb-ansible/inventory.ini and tidb-ansible/log/ansible.log files and the error message, or new issue
on https://github.com/pingcap/tidb-ansible/issues. We'll try our best to help you deploy a TiDB cluster.
Thanks. :-) [tidb@xm-tidb-01 tidb-ansible]$
修改文件:
vim bootstrap.yml
注销掉:
- { role: check_system_optional, when: not dev_mode|default(false) }
- { role: machine_benchmark, when: not dev_mode|default(false) }
- name: check system hosts: all any_errors_fatal: true roles: - check_system_static # - { role: check_system_optional, when: not dev_mode|default(false) } - name: tikv_servers machine benchmark hosts: tikv_servers gather_facts: false roles: # - { role: machine_benchmark, when: not dev_mode|default(false) }
再次运行:
ansible-playbook bootstrap.yml
[tidb@xm-tidb-01 tidb-ansible]$ ansible-playbook bootstrap.yml PLAY [initializing deployment target] ********************************************************************** TASK [check_config_static : Ensure only one monitoring host exists] **************************************** TASK [check_config_static : Ensure monitored_servers exists] *********************************************** TASK [check_config_static : Ensure TiDB host exists] ******************************************************* TASK [check_config_static : Ensure PD host exists] ********************************************************* TASK [check_config_static : Ensure TiKV host exists] ******************************************************* TASK [check_config_static : Check ansible_user variable] *************************************************** TASK [check_config_static : Ensure timezone variable is set] *********************************************** TASK [check_config_static : Close old SSH control master processes] **************************************** ok: [localhost] PLAY [check node config] *********************************************************************************** TASK [pre-ansible : disk space check - fail when disk is full] ********************************************* ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [pre-ansible : Get distro name from /etc/os-release] ************************************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [pre-ansible : set distro facts] ********************************************************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [pre-ansible : python check] ************************************************************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [pre-ansible : set has_python facts] ****************************************************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [pre-ansible : set has_python facts] ****************************************************************** TASK [pre-ansible : include_tasks] ************************************************************************* TASK [pre-ansible : include_tasks] ************************************************************************* included: /home/tidb/tidb-ansible/roles/pre-ansible/tasks/root_tasks.yml for 192.168.10.221, 192.168.10.222, 192.168.10.223, 192.168.10.224, 192.168.10.225 TASK [pre-ansible : Debian/Ubuntu - install python] ******************************************************** TASK [pre-ansible : Redhat/CentOS - install python] ******************************************************** TASK [pre-ansible : Redhat/CentOS - Make sure ntp, ntpstat have been installed] **************************** ok: [192.168.10.221] => (item=[u'ntp']) ok: [192.168.10.222] => (item=[u'ntp']) ok: [192.168.10.224] => (item=[u'ntp']) ok: [192.168.10.223] => (item=[u'ntp']) ok: [192.168.10.225] => (item=[u'ntp']) TASK [pre-ansible : Debian/Ubuntu - Make sure ntp, ntpstat have been installed] **************************** TASK [bootstrap : gather facts] **************************************************************************** ok: [192.168.10.225] ok: [192.168.10.222] ok: [192.168.10.221] ok: [192.168.10.223] ok: [192.168.10.224] TASK [bootstrap : group hosts by distribution] ************************************************************* ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [bootstrap : Set deploy_dir if not presented] ********************************************************* TASK [bootstrap : include_tasks] *************************************************************************** included: /home/tidb/tidb-ansible/roles/bootstrap/tasks/root_tasks.yml for 192.168.10.221, 192.168.10.222, 192.168.10.223, 192.168.10.224, 192.168.10.225 TASK [bootstrap : setting absent kernel params] ************************************************************ ok: [192.168.10.222] => (item={u'name': u'net.ipv4.tcp_tw_recycle', u'value': 0}) ok: [192.168.10.225] => (item={u'name': u'net.ipv4.tcp_tw_recycle', u'value': 0}) ok: [192.168.10.223] => (item={u'name': u'net.ipv4.tcp_tw_recycle', u'value': 0}) ok: [192.168.10.221] => (item={u'name': u'net.ipv4.tcp_tw_recycle', u'value': 0}) ok: [192.168.10.224] => (item={u'name': u'net.ipv4.tcp_tw_recycle', u'value': 0}) TASK [bootstrap : setting present kernel params] *********************************************************** ok: [192.168.10.221] => (item={u'name': u'net.core.somaxconn', u'value': 32768}) ok: [192.168.10.223] => (item={u'name': u'net.core.somaxconn', u'value': 32768}) ok: [192.168.10.224] => (item={u'name': u'net.core.somaxconn', u'value': 32768}) ok: [192.168.10.225] => (item={u'name': u'net.core.somaxconn', u'value': 32768}) ok: [192.168.10.222] => (item={u'name': u'net.core.somaxconn', u'value': 32768}) ok: [192.168.10.221] => (item={u'name': u'vm.swappiness', u'value': 0}) ok: [192.168.10.223] => (item={u'name': u'vm.swappiness', u'value': 0}) ok: [192.168.10.225] => (item={u'name': u'vm.swappiness', u'value': 0}) ok: [192.168.10.224] => (item={u'name': u'vm.swappiness', u'value': 0}) ok: [192.168.10.222] => (item={u'name': u'vm.swappiness', u'value': 0}) ok: [192.168.10.221] => (item={u'name': u'net.ipv4.tcp_syncookies', u'value': 0}) ok: [192.168.10.223] => (item={u'name': u'net.ipv4.tcp_syncookies', u'value': 0}) ok: [192.168.10.225] => (item={u'name': u'net.ipv4.tcp_syncookies', u'value': 0}) ok: [192.168.10.224] => (item={u'name': u'net.ipv4.tcp_syncookies', u'value': 0}) ok: [192.168.10.222] => (item={u'name': u'net.ipv4.tcp_syncookies', u'value': 0}) ok: [192.168.10.221] => (item={u'name': u'fs.file-max', u'value': 1000000}) ok: [192.168.10.223] => (item={u'name': u'fs.file-max', u'value': 1000000}) ok: [192.168.10.224] => (item={u'name': u'fs.file-max', u'value': 1000000}) ok: [192.168.10.222] => (item={u'name': u'fs.file-max', u'value': 1000000}) ok: [192.168.10.225] => (item={u'name': u'fs.file-max', u'value': 1000000}) TASK [bootstrap : update /etc/security/limits.conf] ******************************************************** ok: [192.168.10.221] ok: [192.168.10.224] ok: [192.168.10.223] ok: [192.168.10.225] ok: [192.168.10.222] TASK [bootstrap : disable swap] **************************************************************************** TASK [bootstrap : create group] **************************************************************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.224] ok: [192.168.10.225] ok: [192.168.10.223] TASK [bootstrap : create account] ************************************************************************** ok: [192.168.10.222] ok: [192.168.10.221] ok: [192.168.10.224] ok: [192.168.10.225] ok: [192.168.10.223] TASK [bootstrap : create top deploy dir when under root] *************************************************** ok: [192.168.10.221] ok: [192.168.10.224] ok: [192.168.10.223] ok: [192.168.10.222] ok: [192.168.10.225] TASK [bootstrap : create wal_dir deploy dir when under root] *********************************************** TASK [bootstrap : create raftdb_path deploy dir when under root] ******************************************* TASK [bootstrap : set hostname if hostname is not distinguishable] ***************************************** TASK [bootstrap : set hostname in hosts file] ************************************************************** TASK [bootstrap : determine if firewalld is running] ******************************************************* ok: [192.168.10.223] ok: [192.168.10.221] ok: [192.168.10.225] ok: [192.168.10.224] ok: [192.168.10.222] TASK [bootstrap : disable firewalld] *********************************************************************** TASK [bootstrap : or to enable firewalld] ****************************************************************** TASK [bootstrap : check centos configuration file exists] ************************************************** ok: [192.168.10.222] ok: [192.168.10.221] ok: [192.168.10.225] ok: [192.168.10.224] ok: [192.168.10.223] TASK [bootstrap : check debian configuration file exists] ************************************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [bootstrap : modify centos irqbalance configuration file] ********************************************* ok: [192.168.10.221] ok: [192.168.10.224] ok: [192.168.10.223] ok: [192.168.10.225] ok: [192.168.10.222] TASK [bootstrap : modify debian irqbalance configuration file] ********************************************* TASK [bootstrap : start irqbalance service] **************************************************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.224] ok: [192.168.10.225] ok: [192.168.10.223] PLAY [check system] **************************************************************************************** TASK [check_system_static : Disk space check - Fail task when disk is full] ******************************** ok: [192.168.10.221] ok: [192.168.10.222] ok: [192.168.10.223] ok: [192.168.10.224] ok: [192.168.10.225] TASK [check_system_static : get facts] ********************************************************************* ok: [192.168.10.221] ok: [192.168.10.225] ok: [192.168.10.224] ok: [192.168.10.222] ok: [192.168.10.223] TASK [check_system_static : Preflight check - Linux OS family and distribution version] ******************** TASK [check_system_static : Deploy check_cpufreq script] *************************************************** changed: [192.168.10.221] changed: [192.168.10.224] changed: [192.168.10.223] changed: [192.168.10.225] changed: [192.168.10.222] TASK [check_system_static : Preflight check - Check CPUfreq governors available in the kernel] ************* changed: [192.168.10.221] changed: [192.168.10.222] changed: [192.168.10.223] changed: [192.168.10.224] changed: [192.168.10.225] TASK [check_system_static : Preflight check - Check the currently active governor] ************************* changed: [192.168.10.224] changed: [192.168.10.223] changed: [192.168.10.222] changed: [192.168.10.225] changed: [192.168.10.221] TASK [check_system_static : Preflight check - Fail when CPU frequency governor is not set to performance mode] *** TASK [check_system_static : Clean check_cpufreq script] **************************************************** changed: [192.168.10.221] changed: [192.168.10.222] changed: [192.168.10.223] changed: [192.168.10.224] changed: [192.168.10.225] TASK [check_system_static : Preflight check - Check Linux kernel overcommit_memory parameter] ************** changed: [192.168.10.221] changed: [192.168.10.223] changed: [192.168.10.222] changed: [192.168.10.225] changed: [192.168.10.224] TASK [check_system_static : Preflight check - Fail when Linux kernel vm.overcommit_memory parameter is set to 2] *** PLAY [tikv_servers machine benchmark] ********************************************************************** PLAY [create ops scripts] ********************************************************************************** TASK [ops : create check_tikv.sh script] ******************************************************************* changed: [localhost] TASK [ops : create pd-ctl.sh script] *********************************************************************** changed: [localhost] PLAY RECAP ************************************************************************************************* 192.168.10.221 : ok=28 changed=5 unreachable=0 failed=0 192.168.10.222 : ok=28 changed=5 unreachable=0 failed=0 192.168.10.223 : ok=28 changed=5 unreachable=0 failed=0 192.168.10.224 : ok=28 changed=5 unreachable=0 failed=0 192.168.10.225 : ok=28 changed=5 unreachable=0 failed=0 localhost : ok=3 changed=2 unreachable=0 failed=0 Congrats! All goes well. :-)
9.6:部署TiDB集群软件:
ansible-playbook deploy.yml
过程比较多,请耐心等待
PLAY RECAP ************************************************************************************************* 192.168.10.221 : ok=115 changed=60 unreachable=0 failed=0 192.168.10.222 : ok=52 changed=23 unreachable=0 failed=0 192.168.10.223 : ok=60 changed=24 unreachable=0 failed=0 192.168.10.224 : ok=60 changed=24 unreachable=0 failed=0 192.168.10.225 : ok=63 changed=26 unreachable=0 failed=0 localhost : ok=1 changed=0 unreachable=0 failed=0 Congrats! All goes well. :-) [tidb@xm-tidb-01 tidb-ansible]$
注:Grafana Dashboard 上的 Report 按钮可用来生成 PDF 文件,此功能依赖 fontconfig
包和英文字体。如需使用该功能,登录 grafana_servers 机器,用以下命令安装:
sudo yum install fontconfig open-sans-fonts
9.7:启动集群:
!!!先切换到tidb用户!!!
su tidb
cd /home/tidb/tidb-ansible
ansible-playbook start.yml
PLAY RECAP ************************************************************************************************* 192.168.10.221 : ok=33 changed=12 unreachable=0 failed=0 192.168.10.222 : ok=12 changed=3 unreachable=0 failed=0 192.168.10.223 : ok=14 changed=3 unreachable=0 failed=0 192.168.10.224 : ok=14 changed=3 unreachable=0 failed=0 192.168.10.225 : ok=14 changed=3 unreachable=0 failed=0 localhost : ok=1 changed=0 unreachable=0 failed=0 Congrats! All goes well. :-) [tidb@xm-tidb-01 tidb-ansible]$
9.8: 测试集群:
在安装有mysql的其他服务器做连接测试
mysql -u root -h 192.168.10.221 -P 4000
-u和-h后的参数间不可有空格,否则会报错
mysql -uroot -pxxx -h 192.168.10.221 -P 4000
[root@zabbix ~]# mysql -u root -h 192.168.10.221 -P 4000 Welcome to the MariaDB monitor. Commands end with ; or \g. Your MySQL connection id is 51 Server version: 5.7.10-TiDB-v2.1.4 MySQL Community Server (Apache License 2.0) Copyright (c) 2000, 2018, Oracle, MariaDB Corporation Ab and others. Type 'help;' or '\h' for help. Type '\c' to clear the current input statement. MySQL [(none)]>
http://192.168.10.221:3000/
admin admin
感谢:
官方文档:
https://pingcap.com/docs-cn/overview/
https://blog.csdn.net/xujiamin0022016/article/details/83507038