Python 库

PySnooper

python的自动debug工具，只需要在函数上用它装饰，运行程序时，就可以知道每一个程序运行的步骤，参数和返回值，在找错误时非常方便，如下使用

import pysnooper
 
@pysnooper.snoop()
def demo_func():
    profile = {}
    profile["name"] = "写代码的明哥"
    profile["age"] = 27
    profile["gender"] = "male"
 
    return profile
 
def main():
    profile = demo_func()
 
main()

2.requests

这个模块是python用来爬取网站时，获取网站数据，用requests就可以轻松获取到数据，上传或者下载超过2g的大文件都可以。

In [1]: import requests  
In [2]: resp = requests.get('http://www.baidu.com')

In [3]: resp.status_code
Out[3]: 200

3.requests-html

这个模块是requests作者写的，也是针对爬虫模块，当我们获取到是一个网页，按照平常我们可能会用lxml解析网页，用这个模块可以省略这个步骤，轻松获取网站的标题，url等

from requests_html import HTMLSession

session = HTMLSession()
r = session.get('https://www.qiushibaike.com/text/')
// 查看页面内容
print(r.html.html)

4.lxml

我们解析xml通常都用xml.dom模块，可是使用特别的繁琐，所来lxml不仅可以解析html，还可以解析xml，真心好用

from lxml import etree#导入lxml库
tree = etree.parse("dblp.xml")#将xml解析为树结构
root = tree.getroot()#获得该树的树根

posted @ 2022-05-23 10:02 飞雪飘鸿阅读(39) 评论(0) 编辑收藏举报

刷新页面返回顶部

整个互联网就是一个巨大的计算机

Python 库

公告