2024 年 2月 26 日随笔档案 - 会秃头的小白

2024年2月26日

摘要： from lxml import etree import requests #爬取所有城市名称 if __name__ == '__main__': url = 'https://www.aqistudy.cn/historydata/' headers = { 'User-Agent':'Moz 阅读全文

posted @ 2024-02-26 21:23 会秃头的小白阅读(13) 评论(0) 推荐(0) 编辑

爬取彼岸图库中的图片数据

摘要： from lxml import etree import requests import os # 爬取彼岸图库中的图片数据 if __name__ == '__main__': #爬取到页面源码数据 url = 'https://pic.netbian.com/4kmeinv/' headers 阅读全文

posted @ 2024-02-26 18:37 会秃头的小白阅读(10) 评论(0) 推荐(0) 编辑

爬取58二手房数据

摘要： from lxml import etree import requests # 爬取58二手房 if __name__ == '__main__': #爬取到页面源码数据 url = 'https://m.58.com/bj/ershoufang/?reform=pcfront&PGTID=0d0 阅读全文

posted @ 2024-02-26 18:03 会秃头的小白阅读(23) 评论(0) 推荐(0) 编辑

xpath

摘要：笔记 xpath解析原理： - 数据解析原理： -1.实例化一个etree对象，且将页面源码数据加载到该对象中 -2.调用etree对象中xpath方法，编写xpath表达式，提取数据 - 环境安装： - pip install lxml - 实例化一个etree对象: from lxml impo 阅读全文

posted @ 2024-02-26 17:16 会秃头的小白阅读(7) 评论(0) 推荐(0) 编辑

lin513

公告