Haproxy的各种调度算法及其实现方式

官方文档:HAProxy version 2.4.15 - Configuration Manual (cbonte.github.io)

1、静态算法

静态算法：按照事先定义好的规则轮询进行调度，不关心后端服务器的当前负载、连接数和响应速度等，且无法实时动态修改权重(只能为0和1,不支持其它值)或者修改后不生效，如果需要修改只能靠重启HAProxy生效。

1.1 static-rr 算法

static-rr：基于权重的轮询调度，不支持运行时利用socat进行权重的动态调整(只支持0和1,不支持其它值)及后端服务器慢启动，其后端主机数量没有限制，相当于LVS中的wrr

范例：

[root@ubuntu2004 ~]#vim/etc/haproxy/haproxy.cfg
#在配置文件中插入以下语句

listen web_host
　　bind 10.0.0.101:80
　　balance static-rr
　　server rs1 10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 5
　　server rs2 10.0.0.18:80 weight 2 check inter 3000 fall 2 rise 5

#重启服务

[root@ubuntu2004 ~]#systemctl restart haproxy

#测试访问

[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28

[root@rocky8 ~]#curl 10.0.0.101

Web 10.0.0.18
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.18
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28

1.2 first 算法

first：根据服务器在列表中的位置，自上而下进行调度，但是其只会当第一台服务器的连接数达到上限，新请求才会分配给下一台服务，因此会忽略服务器的权重设置，此方式使用较少；不支持用socat进行动态修改权重,可以设置0和1,可以设置其它值但无效。

范例：

[root@ubuntu2004 ~]# vim /etc/haproxy/haproxy.cfg
listen web_host
    bind 10.0.0.101:80
    balance first
    server rs1 10.0.0.28:80 maxconn 2 weight 1 check inter 3000 fall 2 rise 5   #maxconn 2：当前sever的最大连接数为2                              
    server rs2 10.0.0.18:80 weight 1 check inter 3000 fall 2 rise 5
[root@haproxy ~]# systemctl reload haproxy

#测试访问
#同时运行下面命令，观察结果
root@rocky8:~# while true;do curl http://10.0.0.101/index.html ; sleep 1;done

2、动态算法

动态算法：基于后端服务器状态进行调度适当调整，新请求将优先调度至当前负载较低的服务器，且权重可以在haproxy运行时动态调整无需重启。

2.1 roundrobin

roundrobin：基于权重的轮询动态调度算法，支持权重的运行时调整，不同于lvs中的rr轮训模式，HAProxy中的roundrobin支持慢启动(新加的服务器会逐渐增加转发数)，其每个后端backend中最多支持4095个real server，支持权重动态调整，roundrobin为默认调度算法,此算法使用广泛。

[root@ubuntu2004 ~]# vim /etc/haproxy/haproxy.cfg
listen web_host
    bind 10.0.0.101:80
    balance roundrobin
    server rs1 10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 5
    server rs2 10.0.0.18:80 weight 2 check inter 3000 fall 2 rise 5
[root@ubuntu2004 ~]# systemctl reload haproxy

#测试访问
root@rocky8:~# curl 10.0.0.101
Web 10.0.0.28
root@rocky8:~# curl 10.0.0.101
Web 10.0.0.18
root@rocky8:~# curl 10.0.0.101
Web 10.0.0.18
root@rocky8:~# curl 10.0.0.101
Web 10.0.0.28

支持动态调整权重：

#需要提前安装socat工具
# echo "get weight web_host/rs1" | socat stdio /var/lib/haproxy/haproxy.sock  
1 (initial 1)
# echo "set weight web_host/rs1 3" | socat stdio /var/lib/haproxy/haproxy.sock #设置rs1的权重为3
# echo "get weight web_host/rs1" | socat stdio /var/lib/haproxy/haproxy.sock 
3 (initial 1)

#测试已生效

[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.18
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.18

2.2 leastconn算法

leastconn加权的最少连接的动态，支持权重的运行时调整和慢启动，即:根据当前连接最少的后端服务器而非权重进行优先调度(新客户端连接)，比较适合长连接的场景使用，比如：MySQL等场景。

2.3 random算法

在1.9版本开始增加 random的负载平衡算法，其基于随机数作为一致性hash的key，随机负载平衡对于大型服务器场或经常添加或删除服务器非常有用，支持weight的动态调整，weight较大的主机有更大概率获取新请求

范例：

[root@ubuntu2004 ~]# vim /etc/haproxy/haproxy.cfg
listen web_host
    bind 10.0.0.101:80
    balance random
    server rs1 10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 5
    server rs2 10.0.0.18:80 weight 2 check inter 3000 fall 2 rise 5
#访问测试

[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.18
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.18
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.18
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.18
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28

3 其他算法

其它算法即可作为静态算法，又可以通过选项成为动态算法

3.1 source

源地址hash，基于用户源地址hash并将请求转发到后端服务器，后续同一个源地址请求将被转发至同一个后端web服务器。此方式当后端服务器数据量发生变化时，会导致很多用户的请求转发至新的后端服务器，默认为静态方式，但是可以通过hash-type支持的选项更改
这个算法一般是在不插入Cookie的TCP模式下使用，也可给拒绝会话cookie的客户提供最好的会话粘性，适用于session会话保持但不支持cookie和缓存的场景
源地址有两种转发客户端请求到后端服务器的服务器选取计算方式，分别是取模法和一致性hash。

3.1.1 map-base取模法

map-based：取模法，对source地址进行hash计算，再基于服务器总权重的取模，最终结果决定将此请求转发至对应的后端服务器。此方法是静态的，即不支持在线调整权重，不支持慢启动，可实现对后端服务器均衡调度。缺点是当服务器的总权重发生变化时，即有服务器上线或下线，都会因总权重发生变化而导致调度结果整体改变，hash-type 指定的默认值为此算法。

所谓取模运算，就是计算两个数相除之后的余数，10%7=3, 7%4=3 map-based算法：基于权重取模，hash(source_ip)%所有后端服务器相加的总权重。

范例：

listen web_host
 bind 10.0.0.7:80,:8801-8810,10.0.0.7:9001-9010
 mode tcp
 log global
 balance source
 hash-type map-based 
 server rs1  10.0.0.18:80 weight 1 check inter 3000 fall 2 rise 3
 server rs2  10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 3
  
#不支持动态调整权重值
[root@ubuntu2004 ~]#echo "set weight web_host/10.0.0.28 10" | socat stdio 
/var/lib/haproxy/haproxy.sock 
Backend is using a static LB algorithm and only accepts weights '0%' and '100%'.
#只能动态上线和下线
[root@ubuntu2004 ~]#echo "set weight web_host/10.0.0.28 0" | socat stdio 
/var/lib/haproxy/haproxy.sock 
[root@ubuntu2004 ~]#echo "get weight web_host/10.0.0.28" | socat stdio 
/var/lib/haproxy/haproxy.sock 
0 (initial 1）

3.1.2 一致性hash

一致性哈希，当服务器的总权重发生变化时，对调度结果影响是局部的，不会引起大的变动，hash（o）mod n ，该hash算法是动态的，支持使用 socat等工具进行在线权重调整，支持慢启动。
算法：1、key1=hash(source_ip)%(2^32) [0---4294967295]

2、keyA=hash(后端服务器虚拟ip)%(2^32)

3、将key1和keyA都放在hash环上，将用户请求调度到离key1最近的keyA对应的后端服务器

示意图：

范例：

listen web_host
    bind 10.0.0.101:80
    balance source
    hash-type consistent
    server rs1 10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 5
    server rs2 10.0.0.18:80 weight 1 check inter 3000 fall 2 rise 5
#测试访问

[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28
[root@rocky8 ~]#curl 10.0.0.101
Web 10.0.0.28

3.2 uri

基于对用户请求的URI的左半部分或整个uri做hash，再将hash结果对总权重进行取模后，根据最终结果将请求转发到后端指定服务器，适用于后端是缓存服务器场景，默认是静态算法，也可以通过hash-type指定map-based和consistent，来定义使用取模法还是一致性hash。
注意：此算法基于应用层，所以只支持 mode http ，不支持 mode tcp

<scheme>://<user>:<password>@<host>:<port>/<path>;<params>?<query>#<frag>
左半部分：/<path>;<params>
整个uri：/<path>;<params>?<query>#<frag>

3.2.1 uri一致性hash

配置信息

listen web_host
 bind 10.0.0.101:80,:8801-8810,10.0.0.101:9001-9010
 mode http
 log global
 balance uri
 hash-type consistent
 server web1 10.0.0.18:80 weight 1 check inter 3000 fall 2 rise 5
 server web2 10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 5

范例：

#在后台两台服务器准备一点文件
[root@rocky8 ~]#for i in {1..10};do echo f$i on 10.0.0.18 > /usr/share/nginx/f$i.html;done 
[root@rocky8 ~]#ls /usr/share/nginx/
f10.html  f2.html  f4.html  f6.html  f8.html  html
f1.html   f3.html  f5.html  f7.html  f9.html  modules

[root@rocky8 ~]#for i in {1..10};do echo f$i on 10.0.0.28 > /usr/share/nginx/f$i.html;done 
[root@rocky8 ~]#ls /usr/share/nginx/
f10.html  f2.html  f4.html  f6.html  f8.html  html
f1.html   f3.html  f5.html  f7.html  f9.html  modules

#访问不同的uri，确认可以将用户同样的请求转发至相同的服务器
root@ubuntu:~# curl http://10.0.0.7/f1.html
f1 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.7/f1.html
f1 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.7/f1.html
f1 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.7/f2.html
f2 on 10.0.0.28
root@ubuntu:~# curl http://10.0.0.7/f2.html
f2 on 10.0.0.28
root@ubuntu:~# curl http://10.0.0.7/f3.html
f3 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.7/f3.html
f3 on 10.0.0.18

3.3 url_param

url_param对用户请求的url中的 params 部分中的一个参数key对应的value值作hash计算，并由服务器总权重相除以后派发至某挑出的服务器；通常用于追踪用户，以确保来自同一个用户的请求始终发往同一个real server，如果无没key，将按roundrobin算法。

#假设：
url = http://www.stars.com/foo/bar/index.php?key=value
#则：
host = "www.stars.com"
url_param = "key=value"

范例：

#配置信息
listen web_host
 bind 10.0.0.101:80,:8801-8810,10.0.0.7:9001-9010
 mode http
 log global
 balance url_param userid #url_param hash
 server web1 10.0.0.18:80 weight 1 check inter 3000 fall 2 rise 5
 server web2 10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 5

#测试访问
root@ubuntu:~# curl http://10.0.0.101/f3.html?userid=123
f3 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.101/f2.html?userid=456
f2 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.101/f2.html?userid=abc
f2 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.101/f1.html?userid=abc
f1 on 10.0.0.18
root@ubuntu:~# curl http://10.0.0.101/index.html?userid=jkl
10.0.0.28
root@ubuntu:~# curl http://10.0.0.01/index.html?userid=jkl
10.0.0.28

3.4 hdr

针对用户每个http头部(header)请求中的指定信息做hash，此处由 name 指定的http首部将会被取出并做hash计算，然后由服务器总权重取模以后派发至某挑出的服务器，如果无有效值，则会使用默认的轮询调度。

范例：

#基于hdr一致性的hash配置
[root@ubuntu2004 ~]# vim /etc/haproxy/haproxy.cfg
listen web_host
    bind 10.0.0.101:80
    balance hdr(User-Agent)
    hash-type consistent
    server rs1 10.0.0.18:80 weight 1 check inter 3000 fall 2 rise 5
    server rs2 10.0.0.28:80 weight 1 check inter 3000 fall 2 rise 5
[root@haproxy ~]# systemctl reload haproxy.service

#测试访问
root@ubuntu:~# curl http://10.0.0.7/index.html
10.0.0.28 
root@ubuntu:~# curl http://10.0.0.7/f1.html
f1 on 10.0.0.28
root@ubuntu:~# curl http://10.0.0.7/f2.html
f2 on 10.0.0.28
root@ubuntu:~# curl http://10.0.0.7/f3.html
f3 on 10.0.0.28
root@ubuntu:~# curl -A 'IE' http://10.0.0.7/f4.html
f4 on 10.0.0.18
root@ubuntu:~# curl -A 'firefox' http://10.0.0.7/f4.html
f4 on 10.0.0.28
root@ubuntu:~# curl -A 'chrome' http://10.0.0.7/f4.html
f4 on 10.0.0.18

3.5 rdp-cookie

rdp-cookie对远windows远程桌面的负载，使用cookie保持会话，默认是静态，也可以通过hash-type 指定map-based和consistent，来定义使用取模法还是一致性hash。由于我的电脑不支持远程桌面，所以就不做示范了。

4 算法总结

#静态
static-rr--------->tcp/http  
first------------->tcp/http  

#动态
roundrobin-------->tcp/http 
leastconn--------->tcp/http 
random------------>tcp/http

#以下静态和动态取决于hash_type是否consistent
source------------>tcp/http
Uri--------------->http
url_param--------->http     
hdr--------------->http
rdp-cookie-------->tcp

#各种算法的使用场景

　first #使用较少

　static-rr #做了session共享的web集群
　roundrobin
　random

　leastconn #用于数据库
　source #基于客户端公网IP的会话保持

　uri--------------->http #缓存服务器，CDN服务商，蓝汛、百度、阿里云、腾讯
　url_param--------->http #可以实现session保持

　hdr #基于客户端请求报文头部做下一步处理

　rdp-cookie #基于Windows主机,很少使用

posted @ 2022-10-25 20:05 gy1105 阅读(298) 评论(0) 编辑收藏举报

刷新页面返回顶部

登录后才能查看或发表评论，立即登录或者逛逛博客园首页

相关博文：

· 利用Haproxy实现https访问

· redis实现sentinel哨兵架构

· 2.总结haproxy各调度算法的实现方式及其应用场景

· Linux-HAproxy调度算法

· haproxy调试算法总结

公告

昵称： gy1105
园龄： 2年6个月
粉丝： 0
关注： 2

+加关注

2025年3月

日

一

二

三

四

五

六

gy001