MySQL 正则表达式

一.介绍

正则表达式用来描述或者匹配符合规则的字符串。它的用法和like比较相似,但是它又比like更强大,能够实现一些很特殊的规则匹配;正则表达式需要使用REGEXP命令,匹配上返回"1"匹配不上返回"0",默认不加条件REGEXP相当于like '%%'。在前面加上NOT相当于NOT LIKE。

命令

说明

^

在字符的开启处进行匹配

$

在字符的末尾处进行匹配

.

匹配任何字符(包括回车和新行)

[….]

匹配括号内的任意单个字符

[m-n]

匹配m到n之间的任意单个字符,例如[0-9],[a-z],[A-Z]

[^..]

不能匹配括号内的任意单个字符

a*

匹配0个或多个a,包括空,可以作为占位符使用.

a+

匹配一个或多个a,不包括空

a?

匹配一个或0个a

a1| a2

匹配a1或a2

a{m}

匹配m个a

a{m,}

匹配m个或者更多个a

a{m,n}

匹配m到n个a

a{,n}

匹配0到n个a

(….)

将模式元素组成单一元素,例如(do)*意思是匹配0个多或多个do

 

二 使用测试

^

在字符串开始处进行匹配

mysql> SELECT  'abc' REGEXP '^a';
+-------------------+
| 'abc' REGEXP '^a' |
+-------------------+
|                 1 |
+-------------------+
row in set (0.00 sec)

$

在字符串末尾开始匹配

mysql> SELECT  'abc' REGEXP 'a$';
+-------------------+
| 'abc' REGEXP 'a$' |
+-------------------+
|                 0 |
+-------------------+
row in set (0.01 sec)

mysql> SELECT  'abc' REGEXP 'c$';
+-------------------+
| 'abc' REGEXP 'c$' |
+-------------------+
|                 1 |
+-------------------+
row in set (0.00 sec) 

.

匹配任意字符

mysql> SELECT  'abc' REGEXP '.a';
+-------------------+
| 'abc' REGEXP '.a' |
+-------------------+
|                 0 |
+-------------------+
row in set (0.00 sec)

mysql> SELECT  'abc' REGEXP '.b';
+-------------------+
| 'abc' REGEXP '.b' |
+-------------------+
|                 1 |
+-------------------+
row in set (0.00 sec)

mysql> SELECT  'abc' REGEXP '.c';
+-------------------+
| 'abc' REGEXP '.c' |
+-------------------+
|                 1 |
+-------------------+
row in set (0.00 sec)

mysql> SELECT  'abc' REGEXP 'a.';
+-------------------+
| 'abc' REGEXP 'a.' |
+-------------------+
|                 1 |
+-------------------+
row in set (0.00 sec)

  

[...]

匹配括号内的任意单个字符

mysql> SELECT  'abc' REGEXP '[xyz]';
+----------------------+
| 'abc' REGEXP '[xyz]' |
+----------------------+
|                    0 |
+----------------------+
row in set (0.00 sec)

mysql> SELECT  'abc' REGEXP '[xaz]';
+----------------------+
| 'abc' REGEXP '[xaz]' |
+----------------------+
|                    1 |
+----------------------+
row in set (0.00 sec)

 

[^...]

注意^符合只有在[]内才是取反的意思,在别的地方都是表示开始处匹配

 注意:REGEXP 前的匹配字符作为一个整体

mysql> SELECT  'a' REGEXP '[^abc]';
+---------------------+
| 'a' REGEXP '[^abc]' |
+---------------------+
|                   0 |
+---------------------+
row in set (0.00 sec)

mysql> SELECT  'x' REGEXP '[^abc]';
+---------------------+
| 'x' REGEXP '[^abc]' |
+---------------------+
|                   1 |
+---------------------+
row in set (0.00 sec)


mysql> SELECT  'abc' REGEXP '[^a]';
+---------------------+
| 'abc' REGEXP '[^a]' |
+---------------------+
|                   1 |
+---------------------+
row in set (0.00 sec)

 注意: 'abc'作为一个整体,所以它匹配不了a

 

a*

匹配0个或多个a,包括空字符串。 可以作为占位符使用.有没有指定字符都可以匹配到数据

mysql> SELECT 'stab' REGEXP '.ta*b';
+-----------------------+
| 'stab' REGEXP '.ta*b' |
+-----------------------+
|                     1 |
+-----------------------+
row in set (0.00 sec)
mysql> SELECT 'stb' REGEXP '.ta*b';
+----------------------+
| 'stb' REGEXP '.ta*b' |
+----------------------+
|                    1 |
+----------------------+

mysql> SELECT '' REGEXP 'a*';
+----------------+
| '' REGEXP 'a*' |
+----------------+
|              1 |
+----------------+
1 row in set (0.00 sec)

  

a+

匹配1个或者多个a,但是不包括空字符

mysql> SELECT 'stab' REGEXP '.ta+b';
+-----------------------+
| 'stab' REGEXP '.ta+b' |
+-----------------------+
|                     1 |
+-----------------------+
row in set (0.00 sec)

mysql> SELECT 'stb' REGEXP '.ta+b';
+----------------------+
| 'stb' REGEXP '.ta+b' |
+----------------------+
|                    0 |
+----------------------+
row in set (0.00 sec)

  

a?

匹配0个或者1个a

mysql> SELECT 'stb' REGEXP '.ta?b';
+----------------------+
| 'stb' REGEXP '.ta?b' |
+----------------------+
|                    1 |
+----------------------+
row in set (0.00 sec)

mysql> SELECT 'stab' REGEXP '.ta?b';
+-----------------------+
| 'stab' REGEXP '.ta?b' |
+-----------------------+
|                     1 |
+-----------------------+
row in set (0.00 sec)

mysql> SELECT 'staab' REGEXP '.ta?b';
+------------------------+
| 'staab' REGEXP '.ta?b' |
+------------------------+
|                      0 |
+------------------------+
row in set (0.00 sec)

 

a1|a2

匹配a1或者a2,

mysql> SELECT 'a' REGEXP 'a|b';
+------------------+
| 'a' REGEXP 'a|b' |
+------------------+
|                1 |
+------------------+
row in set (0.00 sec)

mysql> SELECT 'b' REGEXP 'a|b';
+------------------+
| 'b' REGEXP 'a|b' |
+------------------+
|                1 |
+------------------+
row in set (0.00 sec)

mysql> SELECT 'b' REGEXP '^(a|b)';
+---------------------+
| 'b' REGEXP '^(a|b)' |
+---------------------+
|                   1 |
+---------------------+
row in set (0.00 sec)

mysql> SELECT 'a' REGEXP '^(a|b)';
+---------------------+
| 'a' REGEXP '^(a|b)' |
+---------------------+
|                   1 |
+---------------------+
row in set (0.00 sec)

mysql> SELECT 'c' REGEXP '^(a|b)';
+---------------------+
| 'c' REGEXP '^(a|b)' |
+---------------------+
|                   0 |
+---------------------+
row in set (0.00 sec)

 注意:^符合只有在[]内才是取反的意思,在别的地方都是表示开始处匹配 

 

a{m}

匹配m个a

mysql> SELECT 'auuuuc' REGEXP 'au{4}c';
+--------------------------+
| 'auuuuc' REGEXP 'au{4}c' |
+--------------------------+
|                        1 |
+--------------------------+
row in set (0.00 sec)

mysql> SELECT 'auuuuc' REGEXP 'au{3}c';
+--------------------------+
| 'auuuuc' REGEXP 'au{3}c' |
+--------------------------+
|                        0 |
+--------------------------+
row in set (0.00 sec)

  

a{m,}

匹配m个或者更多个a

mysql> SELECT 'auuuuc' REGEXP 'au{3,}c';
+---------------------------+
| 'auuuuc' REGEXP 'au{3,}c' |
+---------------------------+
|                         1 |
+---------------------------+
row in set (0.00 sec)

mysql> SELECT 'auuuuc' REGEXP 'au{4,}c';
+---------------------------+
| 'auuuuc' REGEXP 'au{4,}c' |
+---------------------------+
|                         1 |
+---------------------------+
row in set (0.00 sec)

mysql> SELECT 'auuuuc' REGEXP 'au{5,}c';
+---------------------------+
| 'auuuuc' REGEXP 'au{5,}c' |
+---------------------------+
|                         0 |
+---------------------------+
row in set (0.00 sec)

  

a{m,n}

匹配m到n个a,包含m和n

mysql> SELECT 'auuuuc' REGEXP 'au{3,5}c';
+----------------------------+
| 'auuuuc' REGEXP 'au{3,5}c' |
+----------------------------+
|                          1 |
+----------------------------+
row in set (0.00 sec)

mysql> SELECT 'auuuuc' REGEXP 'au{4,5}c';
+----------------------------+
| 'auuuuc' REGEXP 'au{4,5}c' |
+----------------------------+
|                          1 |
+----------------------------+
row in set (0.00 sec)

mysql> SELECT 'auuuuc' REGEXP 'au{5,10}c';
+-----------------------------+
| 'auuuuc' REGEXP 'au{5,10}c' |
+-----------------------------+
|                           0 |
+-----------------------------+
row in set (0.00 sec)

  

(abc)

将abc作为一个序列匹配,不用括号括起来都是用单个字符去匹配,如果要把多个字符作为一个整体去匹配就需要用到括号,所以括号适合上面的所有情况。

mysql> SELECT 'xababy' REGEXP 'x(abab)y';
+----------------------------+
| 'xababy' REGEXP 'x(abab)y' |
+----------------------------+
|                          1 |
+----------------------------+
row in set (0.00 sec)

mysql> SELECT 'xababy' REGEXP 'x(ab)*y';
+---------------------------+
| 'xababy' REGEXP 'x(ab)*y' |
+---------------------------+
|                         1 |
+---------------------------+
row in set (0.00 sec)

mysql> SELECT 'xababy' REGEXP 'x(ab){1,2}y';
+-------------------------------+
| 'xababy' REGEXP 'x(ab){1,2}y' |
+-------------------------------+
|                             1 |
+-------------------------------+
row in set (0.00 sec)

总结

特别要注意最后的()的命令,如果不使用()那么所有的都是指单个字符去做匹配,如果需要使用多个字符作为一个整体去匹配,就需要将多个字符使用()给括起来

1.使用REGEXP和NOT REGEXP操作符(类似LIKE和NOT LIKE);

2.REGEXP默认也是不区分大小写,可以使用BINARY关键词强制区分大小写; WHERE NAME REGEXP BINARY ‘^[A-Z]’;

3.REGEXP默认是部分匹配原则,即有一个匹配上则返回真。例如:SELECT  'Abc' REGEXP BINARY '[A-Z]';

 

 

 

posted @ 2017-12-19 18:05  A.隔壁老王  阅读(728)  评论(0编辑  收藏  举报