python爬取网站指定数据并存入excel

1:安装库

pip install beautifulsoup4
pip install pandas

2:爬取数据

我们拿 https://cuiqingcai.com/archives/ 网站为例子,来进行爬取文章标题

import requests
from bs4 import BeautifulSoup
import pandas as pd
import openpyxl

# 请求网页数据
res = requests.get("https://cuiqingcai.com/archives/")
soup = BeautifulSoup(res.text, "html.parser")

# 爬取数据
data = []
for div in soup.find_all("div", class_="post-title"):
    data.append(div.text)

# 存入Excel
df = pd.DataFrame(data, columns=["Data"])
df.to_excel("data.xlsx", index=False)

 

posted @ 2023-02-06 15:42  Old·Artist  阅读(416)  评论(0编辑  收藏  举报