04 2017 档案

摘要:from bs4 import BeautifulSoup import requests link_list = [] def get_soup(url): #获取网页的HTML文件,并用BeautifulSoup做成soup html = requests.get(url) soup = BeautifulSoup(html.text,'lxml') ... 阅读全文
posted @ 2017-04-30 00:34 睚一 阅读(271) 评论(0) 推荐(0) 编辑
摘要:import requests import re import time from bs4 import BeautifulSoup today = time.strftime('%Y-%m-%d',time.localtime(time.time())) one_url = 'http://hz.house.qq.com' #用来构建新的URL的链接 url = 'http://... 阅读全文
posted @ 2017-04-26 12:01 睚一 阅读(319) 评论(0) 推荐(0) 编辑
摘要:import requests import re import xlwt def Get_news(): url = 'https://www.jin10.com/' html = requests.get(url) html.encoding = html.apparent_encoding r 阅读全文
posted @ 2017-04-25 16:22 睚一 阅读(428) 评论(0) 推荐(0) 编辑
摘要:import requests from bs4 import BeautifulSoup import xlwt #写入Excel的库 def excel_write(MV_list): newtable = 'MV.xls' #创建Excel文件的名称 wb = xlwt.Workbook(encoding = 'utf-8') #创建Ex... 阅读全文
posted @ 2017-04-24 16:50 睚一 阅读(223) 评论(0) 推荐(0) 编辑