re模块常用方法 - 博客

[{"createTime":1735734952000,"id":1,"img":"bandupan_350_218.jpg","link":"https://pan.baidu.com/s/1T03izdWtRSeMqOXoT9HCug?pwd=draw","name":"百度网盘下载","status":9,"txt":"百度网盘下载","type":1,"updateTime":1735747411000,"userId":3},{"createTime":1736173885000,"id":2,"img":"txy_480_300.png","link":"https://cloud.tencent.com/act/cps/redirect?redirect=1077&cps_key=edb15096bfff75effaaa8c8bb66138bd&from=console","name":"腾讯云秒杀","status":9,"txt":"腾讯云限量秒杀","type":1,"updateTime":1736173885000,"userId":3},{"createTime":1736177492000,"id":3,"img":"aly_251_140.png","link":"https://www.aliyun.com/minisite/goods?userCode=pwp8kmv3","memo":"","name":"阿里云","status":9,"txt":"阿里云2折起","type":1,"updateTime":1736177492000,"userId":3},{"createTime":1735660800000,"id":4,"img":"vultr_560_300.png","link":"https://www.vultr.com/?ref=9603742-8H","name":"Vultr","status":9,"txt":"Vultr送$100","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":5,"img":"jdy_663_320.jpg","link":"https://3.cn/2ay1-e5t","name":"京东云","status":9,"txt":"京东云特惠专区","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":6,"img":"qk_443_300.png","link":"https://pan.quark.cn/s/6229b93c70d0","name":"夸克网盘","status":9,"txt":"夸克网盘","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":7,"img":"yun_910_50.png","link":"https://activity.huaweicloud.com/discount_area_v5/index.html?fromacct=261f35b6-af54-4511-a2ca-910fa15905d1&utm_source=aXhpYW95YW5nOA===&utm_medium=cps&utm_campaign=201905","name":"底部","status":9,"txt":"高性能云服务器2折起","type":2,"updateTime":1735660800000,"userId":3}]

<>re模块常用方法

*
正则表达式，又称规则表达式。（英语：Regular
Expression，在代码中常简写为regex、regexp或RE），计算机科学的一个概念。正则表达式通常被用来检索、替换那些符合某个模式(规则)的文本。

*
给定一个正则表达式和另一个字符串，我们可以达到如下的目的：
给定的字符串是否符合正则表达式的过滤逻辑（称作“匹配”）；
可以通过正则表达式，从字符串中获取我们想要的特定部分。

*
正则表达式的特点是：
灵活性、逻辑性和功能性非常强；
可以迅速地用极简单的方式达到字符串的复杂控制；
对于刚接触的人来说，比较晦涩难懂。
re模块操作
在Python中通过re模块来完成正则表达式操作

match(string[, pos[, endpos]])
string 是待匹配的字符串 pos和 endpos 可选参数，指定字符串的起始和终点位置，默认值分别是 0和 len(字符串长度)。
# match 方法：从起始位置开始查找，一次匹配 re.match(pattern, string, flags=0) result =
re.match("hello", "hellolzt world") print(result, result.group(), type(result))
在字符串开头匹配pattern，如果匹配成功（可以是空字符串）返回对应的match对象,否则返回None。

<>search 方法

* 查找字符串的任何位置，只匹配一次，只要找到了一个匹配的结果就返回
search(string[, pos[, endpos]]) ,string是待匹配的字符串 pos 和 endpos
可选参数，指定字符串的起始和终点位置。当匹配成功时，返回一个Match 对象，如果没有匹配上，则返回 None。扫描整个字符串string，找到与正则表达式
pattern的第一个匹配（可以是空字符串），并返回一个对应的match对象。如果没有匹配返回None. re.search(pattern, string,
flags=0) result = re.search("hello", "2018hellolzt world") print(result.group())
<>fullmatch方法

* fullmatch(pattern, string, flags=0)，是match函数的完全匹配（从字符串开头到结尾）
re.fullmatch(pattern, string, flags=0) result = re.fullmatch("hello", "hello1")
print(result)
string是否整个和pattern匹配，如果是返回对应的match对象,否则返回None。

<>findall方法

* 以列表形式返回全部能匹配的子串，如果没有匹配，则返回一个空列表。 findall(string[, pos[, endpos]]),string
待匹配的字符串pos 和 endpos 可选参数，指定字符串的起始和终点位置。 findall(pattern, string, flags=0)
result = re.findall("hello", "lzt hello china hello world") print(result,
type(result)) # 返回列表
<>split方法

* 按照能够匹配的子串将字符串分割后返回列表 split(string[, maxsplit]),maxsplit用于指定最大分割次数，不指定将全部分割。
re.split(pattern, string, maxsplit=0, flags=0) result = re.split("hello",
"hello china hello world", 2) print(result, type(result)) # 返回分割列表
<>sub方法

* 用于替换,sub(repl, string[, count]),epl可以是字符串也可以是一个函数：
(1) 如果repl 是字符串，则会使用 repl去替换字符串每一个匹配的子串
(2) 如果repl 是函数，方法只接受一个参数（Match对象），并返回一个字符串用于替换。
(3) count 用于指定最多替换次数，不指定时全部替换。 sub(pattern, repl, string, count=0, flags=0)
result = re.sub("hello", "hi", "hello china hello world", 2) print(result,
type(result))
使用repl替换pattern匹配到的内容，最多匹配count次

<>iterator方法
finditer(pattern, string, flags=0) result = re.finditer("hello", "hello world
hello china") print(result, type(result)) # 返回迭代器
<>compile方法

* compile 函数用于编译正则表达式，生成一个 Pattern 对象 compile(pattern, flags=0) pat =
re.compile("hello") print(pat, type(pat)) result = pat.search("helloworld")
print(result, type(result)) # 编译得到匹配模型
<>flags

* re模块的一些函数中将flags作为可选参数，下面列出了常用的几个flag, 它们实际对应的是二进制数，可以通过位或将他们组合使用。flags
可能改变正则表达时的行为：
re.I re.IGNORECASE: 匹配中大小写不敏感
re.M re.MULTILINE: “^“匹配字符串开始以及”\n"之后；”$“匹配”\n"之前以及字符串末尾。通常称为多行模式
re.S re.DOTALL: "."匹配任意字符，包括换行符。通常称为单行模式
如果要同时使用单行模式和多行模式，只需要将函数的可选参数flags设置为re.I| re.S即可。 result = re.match("hello",
"HeLlo", flags=re.I) print(result) result =
re.findall("^abc","abcde\nabcd",re.M) print(result) result =
re.findall("e$","abcde\nabcd",re.M) print(result) result = re.findall(".",
"hello \n china", flags=re.S) # "." 可以匹配换行符 print(result) result =
re.findall(".", "hello \n china", flags=re.M) # "." 不可以匹配换行符 print(result)

技术

Java1212 篇
Python927 篇
开发语言608 篇
c语言463 篇
算法461 篇
MySQL438 篇
数据库394 篇
前端387 篇
更多...