【Python】转载一个python 爬虫的帖子
原帖地址
原帖标题:爬取图网的4K图片自动保存本地
https://www.52pojie.cn/thread-1809600-1-1.html
(出处: 吾爱破解论坛)
python 代码
import os.path import random import time import requests from lxml import etree import webbrowser webbrowser.open('https://pic.netbian.com') ua = input("请输入浏览器的User-Agent,不会的请百度\n") print('''例如:https://pic.netbian.com/4kmeinv https://pic.netbian.com/4kfengjing https://pic.netbian.com ''') uuuu = input("请输入彼岸图的URL不带'/'\n") headers = { 'User-Agent': ua } url = uuuu def huoqu(urll): url = urll # print(url) respones = requests.get(url, headers=headers) mg = respones.content.decode("gbk") tree = etree.HTML(mg) img_url = tree.xpath('//ul[@class="clearfix"]//li') # print(img_url) for a in img_url: c = a.xpath('./a//img/@alt')[0] img_mg = a.xpath('./a//img/@src') img_mgg = requests.get('https://pic.netbian.com' + ''.join(img_mg)) if not os.path.exists('彼岸图网'): os.mkdir("彼岸图网") with open(f'彼岸图网/{c}.jpg', 'wb') as f: f.write(img_mgg.content) print(f'彼岸图网/{c}.jpg,保存成功') def zongpage(url): url = url print(url) respones = requests.get(url, headers=headers) mg = respones.content.decode("gbk") tree = etree.HTML(mg) page = tree.xpath('//div[@class="page"]/a/text()') zongpage = page[-2] print('总共:' + zongpage + '页') return zongpage zongpagee = zongpage(url) for a in range(1, int(zongpagee)): if a == 1: huoqu(url) else: uu = f'{url}/index_{a}.html' print(uu) huoqu(uu) time.sleep(random.randint(2, 5))
本文来自博客园,作者:FE-有限元鹰,转载请注明原文链接:https://www.cnblogs.com/aksoam/p/17576919.html
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· 阿里最新开源QwQ-32B,效果媲美deepseek-r1满血版,部署成本又又又降低了!
· 开源Multi-agent AI智能体框架aevatar.ai,欢迎大家贡献代码
· Manus重磅发布:全球首款通用AI代理技术深度解析与实战指南
· 被坑几百块钱后,我竟然真的恢复了删除的微信聊天记录!
· AI技术革命,工作效率10个最佳AI工具