2017 年 4月随笔档案 - 睚一

Python爬取小猪短租，用的是lxml解析器

摘要：from bs4 import BeautifulSoup import requests link_list = [] def get_soup(url): #获取网页的HTML文件，并用BeautifulSoup做成soup html = requests.get(url) soup = BeautifulSoup(html.text,'lxml') ... 阅读全文

posted @ 2017-04-30 00:34 睚一阅读(271) 评论(0) 推荐(0) 编辑

使用Python爬取腾讯房产的新闻，用的Python库：requests 、re、time、BeautifulSoup

摘要：import requests import re import time from bs4 import BeautifulSoup today = time.strftime('%Y-%m-%d',time.localtime(time.time())) one_url = 'http://hz.house.qq.com' #用来构建新的URL的链接 url = 'http://... 阅读全文

posted @ 2017-04-26 12:01 睚一阅读(319) 评论(0) 推荐(0) 编辑

爬去金10网数据,并写入到Excel表格里面（re,requests,xlwt）

摘要：import requests import re import xlwt def Get_news(): url = 'https://www.jin10.com/' html = requests.get(url) html.encoding = html.apparent_encoding r 阅读全文

posted @ 2017-04-25 16:22 睚一阅读(428) 评论(0) 推荐(0) 编辑

爬取音悦台MV信息(requests,BeautifulSoup,xlwt)----待完善

摘要：import requests from bs4 import BeautifulSoup import xlwt #写入Excel的库 def excel_write(MV_list): newtable = 'MV.xls' #创建Excel文件的名称 wb = xlwt.Workbook(encoding = 'utf-8') #创建Ex... 阅读全文

posted @ 2017-04-24 16:50 睚一阅读(223) 评论(0) 推荐(0) 编辑

04 2017 档案

公告