python爬取网站图片保存到本地文件夹

爬取的网站

https://wallpaperscraft.com/catalog/anime

爬取代码

# 导包
import os
import requests
import parsel
from parsel import Selector  

def download_onepagephoto(website_url,count):  # 下载一页图片
    # 用i暂存传输过来的count值
    i=count
    # 发送请求
    response = requests.get(website_url)
    response.encoding = response.apparent_encoding
    # 很关键的一步，构建Selector对象
    sel = Selector(response.text)
    # 获取到网页中样式为wallpapers__item类下a标签的href的值
    index = sel.css('.wallpapers__item a::attr(href)').getall()
    # 遍历进入每个图片
    for line in index:
        # 模拟进入另一个页面，如法炮制上述操作
        response = requests.get("https://wallpaperscraft.com"+line)
        response.encoding = response.apparent_encoding
        sel = Selector(response.text)
        index2 = sel.css('.wallpaper__placeholder a::attr(href)').getall()
        if len(index2)!=0:
            nameurl=index2[0]
            # 获取到图片链接，将其保存到同级目录本地photo文件夹
            photo=requests.get(nameurl).content
            with open("photo/"+str(i)+".jpg","wb") as fp:
                fp.write(photo)
            print(str(i)+" already success")
            i=i+1
    return i

count=1
#爬取第一页
count=download_onepagephoto("https://wallpaperscraft.com/catalog/anime/1920x1080",count)
#爬取第二页及以后
for temp in range(2,174):
    count=download_onepagephoto("https://wallpaperscraft.com/catalog/anime/1920x1080/page"+str(temp),count)
    print("第"+str(temp)+"页图片爬取完成")

【创作不易，望点赞收藏，若有疑问，请评论，谢谢】

posted @ 2022-04-29 08:55 东血阅读(715) 评论(0) 编辑收藏举报

刷新页面返回顶部

登录后才能查看或发表评论，立即登录或者逛逛博客园首页

相关博文：

· 使用Selenium爬取动态网页

· python脚本收集

· python - 简单爬取网站图片

· python爬取壁纸图片到本地

· Python爬取某个网站的图片

阅读排行：
· 震惊！C++程序真的从main开始吗？99%的程序员都答错了
· 【硬核科普】Trae如何「偷看」你的代码？零基础破解AI编程运行原理
· 单元测试从入门到精通
· 上周热点回顾（3.3-3.9）
· winform 绘制太阳，地球，月球运作规律

历史上的今天：
2020-04-29 小程序-自定义组件数据传递

东血

THE SKY RIVER IS NOT AS DAZZING AS YOU

python爬取网站图片保存到本地文件夹

爬取的网站

爬取代码

公告

最新随笔

积分与排名

随笔分类 (138)

随笔档案 (140)

阅读排行榜