Web Scraping using Python Scrapy_BS4 - using BeautifulSoup and Python
Use BeautifulSoup and Python to scrap a website
Lib:
- urllib
- Parsing HTML Data
Web scraping script
from urllib.request import urlopen as uReq from bs4 import BeautifulSoup as soup quotes_page = "https://bluelimelearning.github.io/my-fav-quotes/" uClient = uReq(quotes_page) page_html = uClient.read() uClient.close() page_soup = soup(page_html, "html.parser") quotes = page_soup.findAll("div", {"class":"quotes"}) for quote in quotes: fav_quote = quote.findAll("p", {"class":"aquote"}) aquote = fav_quote[0].text.strip() fav_authors = quote.findAll("p",{"class":"author"}) author = fav_authors[0].text.strip() print(aquote) print(author)
Run this script successfully
Following is the whole result of this scraping.
I hear and i forget. I see and i remember. I do and i understand. Confucious Feeling gratitude and not expressing it is like wrapping a present and not giving it. William Arthur Ward Our greatest glory is not in never falling but in rising every time we fall. Confucious The secret of getting aheadis getting started. Mark Twain Believe you can and you're halfway there. Theodore Roosevelt Resentment is like drinking Poison and waiting for your enemies to die. Nelson Mandela Silence is a true friend who never betrays. Confucius The best way to find yourself is to lose yourself in the service of others. Mahatma Gandhi Never succumb to the temptation of bitterness. Martin Luther King Jnr The journey of a thousand miles begins with one step. Lao Tzu It is health that is real wealth and not pieces of gold and silver. Mahatma Gandhi Yesterday is not ours to recover but tomorrow is ours to win or lose. Lyndon B Johnson It's not what happens to you but how you react to it that matters . Epictetus Beware of what you become in pursuit of what you want. Jim Rohn The best revenge is massive success. Frank Sinatra Do not take life too seriously You will never get out of it alive. Elbert Hubbard Don't judge each day by the harvest you reap but by the seeds that yiu plant. Robert Loius Stevenson Your attitude and not your aptitude will determine your altitude Zig Ziglar Imagination is more important than knowledge. Albert Einstein
.
相信未来 - 该面对的绝不逃避,该执著的永不怨悔,该舍弃的不再留念,该珍惜的好好把握。
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· Linux系列:如何用heaptrack跟踪.NET程序的非托管内存泄露
· 开发者必知的日志记录最佳实践
· SQL Server 2025 AI相关能力初探
· Linux系列:如何用 C#调用 C方法造成内存泄露
· AI与.NET技术实操系列(二):开始使用ML.NET
· 无需6万激活码!GitHub神秘组织3小时极速复刻Manus,手把手教你使用OpenManus搭建本
· C#/.NET/.NET Core优秀项目和框架2025年2月简报
· Manus爆火,是硬核还是营销?
· 终于写完轮子一部分:tcp代理 了,记录一下
· 【杭电多校比赛记录】2025“钉耙编程”中国大学生算法设计春季联赛(1)