Python-re模块

阅读目录(Content)

1、简介
2、常量
3、方法

3.1、编译【compile】
3.2、单次匹配【match、search、fullmatch】

3.2.1、语法介绍
3.2.2、示例

3.3、全文搜索【findall、finditer】

3.3.1、语法
3.3.2、示例

3.4、匹配替换【sub、subn】

3.4.1、语法
3.4.2、示例

3.5、分组【group】

3.5.1、语法
3.5.2、示例

3.6、分割字符串

3.6.1、语法
3.6.2、示例

回到顶部(go to top)

1、简介

Python使用re模块提供了正则表达式处理的能力。

回到顶部(go to top)

2、常量

常量               说明
re.M
re.MULTILINE       多行模式

re.S
re.DOTALL          单行模式

re.I
re.IGNORECASE      忽略大小写

re.X
re.VERBOSE         忽略表达式中的空白字符

使用 | 位或 运算开启多种选项

回到顶部(go to top)

3、方法

3.1、编译【compile】

re.compile(pattern, flags=0)
设定flags，编译模式，返回正则表达式对象regex。
pattern就是正则表达式字符串，flags是选项。正则表达式需要被编译，为了提高效率，这些编译后的
结果被保存，下次使用同样的pattern的时候，就不需要再次编译。
re的其它方法为了提高效率都调用了编译方法，就是为了提速。

3.2、单次匹配【match、search、fullmatch】

3.2.1、语法介绍

re.match(pattern, string, flags=0)
regex.match(string[, pos[, endpos]])
match匹配从字符串的开头匹配，regex对象match方法可以重设定开始位置和结束位置。返回match对象

re.search(pattern, string, flags=0)
regex.search(string[, pos[, endpos]])
从头搜索直到第一个匹配，regex对象search方法可以重设定开始位置和结束位置，返回match对象

re.fullmatch(pattern, string, flags=0)
regex.fullmatch(string[, pos[, endpos]])
整个字符串和正则表达式匹配

3.2.2、示例

import re

s = '''bottle\nbag\nbig\napple'''
for i,c in enumerate(s, 1):
    print((i-1, c), end='\n' if i%10==0 else ' ')
print()

# match方法
print('--match--')
result = re.match('b', s) # 找到一个就不找了
print(1, result) # bottle
result = re.match('a', s) # 没找到，返回None
print(2, result)
result = re.match('^a', s, re.M) # 依然从头开始找，多行模式没有用
print(3, result)
result = re.match('^a', s, re.S) # 依然从头开始找
print(4, result)
# 先编译，然后使用正则表达式对象
regex = re.compile('a')
result = regex.match(s) # 依然从头开始找
print(5, result)
result = regex.match(s, 15) # 把索引15作为开始找
print(6, result) # apple
print()
# search方法
print('--search--')
result = re.search('a', s) # 扫描找到匹配的第一个位置
print(7, result) # apple
regex = re.compile('b')
result = regex.search(s, 1)
print(8, result) # bag
regex = re.compile('^b', re.M)
result = regex.search(s) # 不管是不是多行，找到就返回
print(8.5, result) # bottle
result = regex.search(s, 8)
print(9, result) # big

# fullmatch方法
result = re.fullmatch('bag', s)
print(10, result)
regex = re.compile('bag')
result = regex.fullmatch(s)
print(11, result)
result = regex.fullmatch(s, 7)
print(12, result)
result = regex.fullmatch(s, 7, 10)
print(13, result) # 要完全匹配，多了少了都不行, [7, 10)

3.3、全文搜索【findall、finditer】

3.3.1、语法

re.findall(pattern, string, flags=0)
regex.findall(string[, pos[, endpos]])
对整个字符串，从左至右匹配，返回所有匹配项的列表


re.finditer(pattern, string, flags=0)
regex.finditer(string[, pos[, endpos]])
对整个字符串，从左至右匹配，返回所有匹配项，返回迭代器。
注意每次迭代返回的是match对象。

3.3.2、示例

import re

s = '''bottle\nbag\nbig\napple'''
for i,c in enumerate(s, 1):
    print((i-1, c), end='\n' if i%10==0 else ' ')
print()

# findall方法
result = re.findall('b', s)
print(1, result)
regex = re.compile('^b')
result = regex.findall(s)
print(2, result)
regex = re.compile('^b', re.M)
result = regex.findall(s, 7)
print(3, result) # bag big
regex = re.compile('^b', re.S)
result = regex.findall(s)
print(4, result) # bottle
regex = re.compile('^b', re.M)
result = regex.findall(s, 7, 10)
print(5, result) # bag

# finditer方法
regex = re.compile('^b\w+', re.M)
result = regex.finditer(s)
print(type(result))
r = next(result)
print(type(r), r) # Match对象
print(r.start(), r.end(), s[r.start():r.end()])
r = next(result)
print(type(r), r)
print(r.start(), r.end(), s[r.start():r.end()])

3.4、匹配替换【sub、subn】

3.4.1、语法

re.sub(pattern, replacement, string, count=0, flags=0)
regex.sub(replacement, string, count=0)
使用pattern对字符串string进行匹配，对匹配项使用repl替换。
replacement可以是string、bytes、function。

re.subn(pattern, replacement, string, count=0, flags=0)
regex.subn(replacement, string, count=0)
同sub返回一个元组（new_string， number_of_subs_made）

3.4.2、示例

import re

s = '''bottle\nbag\nbig\napple'''
for i, c in enumerate(s, 1):
    print((i - 1, c), end='\n' if i % 10 == 0 else ' ')
print()

# 替换方法
regex = re.compile('b\wg')
result = regex.sub('test', s)
print(1, result)  # 被替换后的字符串
result = regex.sub('test', s, 1)  # 替换1次
print(2, result)  # 被替换后的字符串
regex = re.compile('\s+')
result = regex.subn('\t', s)
print(3, result)  # 被替换后的字符串及替换次数的元组

3.5、分组【group】

3.5.1、语法

使用小括号的pattern捕获的数据被放到了组group中。
match、search函数可以返回match对象；findall返回字符串列表；finditer返回一个个match对象
如果pattern中使用了分组，如果有匹配的结果，会在match对象中
1. 使用group(N)方式返回对应分组，1到N是对应的分组，0返回整个匹配的字符串，N不写缺省为0
2. 如果使用了命名分组，可以使用group('name')的方式取分组
3. 也可以使用groups()返回所有组
4. 使用groupdict() 返回所有命名的分组

3.5.2、示例

import re

s = '''bottle\nbag\nbig\napple'''
for i, c in enumerate(s, 1):
    print((i - 1, c), end='\n' if i % 10 == 0 else ' ')
print()

# 分组
regex = re.compile('(b\w+)')
result = regex.match(s) # 从头匹配一次
print(type(result))
print(1, 'match', result.group(), result.group(0), result[0],result.groups())
result = regex.search(s, 1) # 从指定位置向后匹配一次
print(2, 'search', result.groups()) #
# 命名分组
regex = re.compile('(b\w+)\n(?P<name2>b\w+)\n(?P<name3>b\w+)')
result = regex.match(s)
print(3, 'match', result)
print(4, result.group(3), result.group(2), result.group(1))
print(5, result.group(0).encode()) # 0 返回整个匹配字符串，即match
print(6, result.group('name2'), result.group('name3'))
print(6, result.groups())
print(7, result.groupdict())
result = regex.findall(s) # 返回什么，有几项？

for x in result: # 有分组里面放的东西不一样
    print(type(x), x)

regex = re.compile('(?P<head>b\w+)')
result = regex.finditer(s)
for x in result:
    print(type(x), x, x.group(), x.group('head'), x['head'], x[0])

3.6、分割字符串

3.6.1、语法

字符串的分割函数split，太难用，不能指定多个字符进行分割。
re.split(pattern, string, maxsplit=0, flags=0)
re.split分割字符串

3.6.2、示例

import re

s = """
os.path.abspath(path) 
normpath(join(os.getcwd(), path)).
"""
# 把每行单词提取出来
print(s.split())  # 做不到['os.path.abspath(path)', 'normpath(join(os.getcwd(),', 'path)).']
print(re.split('[\.()\s,]+', s))

posted @ 2023-07-11 10:58 小粉优化大师阅读(72) 评论(0) 编辑收藏举报

刷新页面返回顶部

登录后才能查看或发表评论，立即登录或者逛逛博客园首页

相关博文：

· Python-正则表达式

· Python-字符串【str】

· python基础教程：re模块用法详解

· Python | import re模块的使用

· Python | 正则表达式(re模块)

小粉优化大师

闻道有先后，术业有专攻 -《师说》

Python-re模块

1、简介

2、常量

3、方法

3.1、编译【compile】

3.2、单次匹配【match、search、fullmatch】

3.2.1、语法介绍

3.2.2、示例

3.3、全文搜索【findall、finditer】

3.3.1、语法

3.3.2、示例

3.4、匹配替换【sub、subn】

3.4.1、语法

3.4.2、示例

3.5、分组【group】

3.5.1、语法

3.5.2、示例

3.6、分割字符串

3.6.1、语法

3.6.2、示例

公告

常用链接

积分与排名

随笔分类 (177)

文章分类 (61)

阅读排行榜