mycat的10种分片规则

所有的分片规则配置的tableRule标签中：

rule标签中的columns标签内填写要分片的表字段，algorithm标签内填写分片所使用的自定义函数名，要与function函数中的name属性保持一致

function函数中的property标签内配置自定义参数。

1)枚举法:sharding-by-intfile

<tableRule name="sharding-by-intfile">

    <rule>

      <columns>user_id</columns>

      <algorithm>hash-int</algorithm>

    </rule>

  </tableRule>

<function name="hash-int" class="io.mycat.route.function.PartitionByFileMap">

    <property name="mapFile">partition-hash-int.txt</property>

    <property name="type">0</property>

    <property name="defaultNode">0</property>

</function>

mapFile 中是自定义的分片策略文件，需要自己编写

type表示分片字段的类型，其中1表示字符串，0表示int类型

defaultNode配置是否使用默认节点，默认为0，表示不设置默认节点，这样会导致遇到不识别的枚举值时会报错，如果设置的值大于零，该值就是默认节点，会把不识别的枚举值分配到默认节点。

这种方法适用于取值固定的场合，例如性别和省份

2)固定分片:rule 1

<tableRule name="rule1">

    <rule>

      <columns>user_id</columns>

      <algorithm>func1</algorithm>

    </rule>

</tableRule> 

  <function name="func1" class="io.mycat.route.function.PartitionByLong">

    <property name="partitionCount">2,1</property>

    <property name="partitionLength">256,512</property>

  </function>

partitionCount 表示分片个数列表，partitionLength 表示分片范围列表，两者可以都配单个值或多个值

因为分区长度默认为最大2^n=1024 ,即最大支持1024分区

所以两个值的点积恒等于1024，也就是说2*256+1*512=1024或者2*512=1024（分为两片）或者4*256=1024（分为四片）

3)范围约定:auto-sharding-long

<tableRule name="auto-sharding-long">

    <rule>

      <columns>user_id</columns>

      <algorithm>rang-long</algorithm>

    </rule>

  </tableRule>

<function name="rang-long" class="io.mycat.route.function.AutoPartitionByLong">

    <property name="mapFile">autopartition-long.txt</property>

  </function>

autopartition-long.txt文件中编写分片规则，根据指定的列的范围进行分片.默认从0节点开始

例如：0-200000=0

200000-400000=1

0-200000范围分配各节点0

200000-400000范围分配各节点1

这种方法适用于总数可知的分片场景，但是扩展比较麻烦，短时间大量顺序插入会造成单个节点压力过大

4)求模法:mod-long

<tableRule name="mod-long">

    <rule>

      <columns>user_id</columns>

      <algorithm>mod-long</algorithm>

    </rule>

  </tableRule>

  <function name="mod-long" class="io.mycat.route.function.PartitionByMod">

   <!-- how many data nodes  -->

    <property name="count">3</property>

  </function>

根据配置中的count值进行分片，将数据分成配置的count份，然后将数据均匀的分布在各个节点上，适用于单节点查询，但是查询量偏高的跨库查询会增加耗时。

5)日期列分区法:sharding-by-date

<tableRule name="sharding-by-date">

      <rule>

        <columns>create_time</columns>

        <algorithm>sharding-by-date</algorithm>

      </rule>

   </tableRule> 

<function name="sharding-by-date" class="io.mycat.route.function..PartitionByDate">

   <property name="dateFormat">yyyy-MM-dd</property>

    <property name="sBeginDate">2019-09-20</property>

    <property name="sPartionDay">5</property>

  </function>

dateFormat为字段格式，sBeginDate为开始日期，sPartionDay为分区天数。

该方法的分区方法为：从开始日期，每隔5天分一个分区；

6)通配取模:sharding-by-pattern

<tableRule name="sharding-by-pattern">

      <rule>

        <columns>user_id</columns>

        <algorithm>sharding-by-pattern</algorithm>

      </rule>

   </tableRule>

<function name="sharding-by-pattern" class="io.mycat.route.function.PartitionByPattern">

    <property name="patternValue">256</property>

    <property name="defaultNode">2</property>

    <property name="mapFile">partition-pattern.txt</property>

  </function>

patternValue是求模基数，使用int型分片字段与基数取模，根据取模结果和配置文件partition-pattern.txt决定分区，如果文件中配置1-32=1，则取模结果在范围1-32时分到1区。如果字段值不为int则分配到defaultNode配置的分区中。

7)ASCII码取模:sharding-by-prefixpattern

<tableRule name="sharding-by-prefixpattern">

      <rule>

        <columns>user_id</columns>

        <algorithm>sharding-by-prefixpattern</algorithm>

      </rule>

   </tableRule>

<function name="sharding-by-pattern" class="io.mycat.route.function.PartitionByPrefixPattern">

    <property name="patternValue">256</property>

    <property name="prefixLength">5</property>

    <property name="mapFile">partition-pattern.txt</property>

  </function>

patternValue 为求模基数，prefixLength ASCII 截取的位数

该方法截取字段值ASCII 码的指定位数（prefixLength 值）与求模基数（patternValue值）取模，然后根据配置文件partition-pattern.txt的内容分区

8)编程指定:sharding-by-substring

<tableRule name="sharding-by-substring">

      <rule>

        <columns>user_id</columns>

        <algorithm>sharding-by-substring</algorithm>

      </rule>

   </tableRule>

<function name="sharding-by-substring" class="io.mycat.route.function.PartitionDirectBySubString">

    <property name="startIndex">0</property> <!-- zero-based -->

    <property name="size">2</property>

    <property name="partitionCount">8</property>

    <property name="defaultPartition">0</property>

  </function>

该方法的指定字段必须为数字，size为截取的位数，partitionCount为分区个数，defaultPartition为默认节点。

给方法从第0个索引的数字截取字段的指定位数，截取到的数字就是分区节点编号，如果没有传值，分配到默认分区节点。例：01-55888分配到01分区

9)字符串拆分hash解析:sharding-by-stringhash

<tableRule name="sharding-by-stringhash">

      <rule>

        <columns>user_id</columns>

        <algorithm>sharding-by-stringhash</algorithm>

      </rule>

   </tableRule>

<function name="sharding-by-substring" class="io.mycat.route.function.PartitionByString">

    <property name=length>512</property> <!-- zero-based -->

    <property name="count">2</property>

    <property name="hashSlice">0:2</property>

  </function>

函数中length代表字符串hash求模基数，count是分区数，hashSlice为预算位，

该方法根据子字符串中的int值进hash运算，然后得出分区；

例如：hashSlice为0:2时截取字段值得0-2位值进行hash求模

hashSlice为-4:0时截取字段值倒数第四位到第0位值进行hash求模

10)一致性hash:sharding-by-murmur

<tableRule name="sharding-by-murmur">

      <rule>

        <columns>user_id</columns>

        <algorithm>murmur</algorithm>

      </rule>

   </tableRule>

<function name="murmur" class="io.mycat.route.function.PartitionByMurmurHash">

      <property name="seed">0</property><!-- 默认是0-->

      <property name="count">2</property><!-- 要分片的数据库节点数量，必须指定，否则没法分片—>

      <property name="virtualBucketTimes">160</property><!-- 一个实际的数据库节点被映射为这么多虚拟节点，默认是160倍，也就是虚拟节点数是物理节点数的160倍-->

      <!--

      <property name="weightMapFile">weightMapFile</property>

                     节点的权重，没有指定权重的节点默认是1。以properties文件的格式填写，以从0开始到count-1的整数值也就是节点索引为key，以节点权重值为值。所有权重值必须是正整数，否则以1代替 -->

      <!--

      <property name="bucketMapPath">/etc/mycat/bucketMapPath</property>

                      用于测试时观察各物理节点与虚拟节点的分布情况，如果指定了这个属性，会把虚拟节点的murmur hash值与物理节点的映射按行输出到这个文件，没有默认值，如果不指定，就不会输出任何东西 -->

  </function>

分类: 读写分离

分类: Mycat, MySQL

标签: mycat

好文要顶关注我收藏该文微信分享

seasonzone
粉丝 - 31 关注 - 57

+加关注

升级成为会员

« 上一篇： Mycat配置分库分表（垂直分库、水平分表）、全局序列 1
» 下一篇： MySQL的延时主从复制功能

https://www.cnblogs.com/KelvinDaniels/p/11572775.html

posted @ 2022-03-24 15:25 seasonzone 阅读(139) 评论(0) 编辑收藏举报

刷新页面返回顶部

登录后才能查看或发表评论，立即登录或者逛逛博客园首页

相关博文：

· 一篇秒懂mycat

· Mycat配置分库分表（垂直分库、水平分表）、全局序列 1

· MyCat03-常用分片规则及配置说明

· mycat分片规则详解+实例演示

· mycat分库分表的分片方案

阅读排行：
· 震惊！C++程序真的从main开始吗？99%的程序员都答错了
· 【硬核科普】Trae如何「偷看」你的代码？零基础破解AI编程运行原理
· 单元测试从入门到精通
· 上周热点回顾（3.3-3.9）
· winform 绘制太阳，地球，月球运作规律

历史上的今天：
2018-03-24 oracle 恢复中的switch datafile all是什么意思

公告

昵称： seasonzone
园龄： 12年4个月
粉丝： 31
关注： 57

+加关注

2025年3月

日

一

二

三

四

五

六

随笔分类

随笔档案

文章分类

docker 资源

docker 实用手册

mongodb 资源

oracle 网上资源

SQLSERVER 资源

http://blog.csdn.net/DBA_Huangzj/article/category/1371240

seasonzone

mycat的10种分片规则

公告

搜索

随笔分类

随笔档案

文章分类

docker 资源

mongodb 资源

oracle 网上资源

SQLSERVER 资源

网上资源 mysql

阅读排行榜

推荐排行榜

最新评论