蓝勃斐重新开始

2018年5月2日

摘要：已经爬取到的斗破苍穹文本以TXT形式存储代码结果阅读全文

posted @ 2018-05-02 14:33 蓝勃斐重新开始阅读(374) 评论(0) 推荐(0) 编辑

2018年4月23日

摘要： from selenium import webdriver url='https://www.jianshu.com/p/a64529b4ccf3' def get_info(url): include_title=[] driver=webdriver.PhantomJS() driver.get(url) driver.implicitl... 阅读全文

posted @ 2018-04-23 10:34 蓝勃斐重新开始阅读(3060) 评论(0) 推荐(0) 编辑

2018年4月20日

从照片网站pexels批量爬取照片

摘要：调试中，未成功。阅读全文

posted @ 2018-04-20 17:21 蓝勃斐重新开始阅读(997) 评论(0) 推荐(0) 编辑

2018年4月19日

爬取豆瓣电影top250并存储到mysql数据库

摘要： import requests from lxml import etree import re import pymysql import time conn= pymysql.connect(host='localhost',user='root',passwd='root',db='mydb' 阅读全文

posted @ 2018-04-19 18:20 蓝勃斐重新开始阅读(681) 评论(0) 推荐(0) 编辑

2018年4月18日

爬取起点中文网小说介绍信息

摘要：字数的信息（word）没有得到缺失阅读全文

posted @ 2018-04-18 17:03 蓝勃斐重新开始阅读(222) 评论(0) 推荐(0) 编辑

2018年4月17日

爬取嗅事百科的段子

摘要：爬取正文（contents）时，需要转码。结果：姓名：niangaoni… 等级：23性别：男刚才看了一篇叫《抖音，快手正在毁掉我们的下一代！！》的文章，我才知道现在的00后，10后后那么逆天！我一个95后经常被他们喊着大叔的人，真的是经常被他们一些行为和语言所震惊！我从来不反感任何一个app 阅读全文

posted @ 2018-04-17 18:01 蓝勃斐重新开始阅读(327) 评论(0) 推荐(0) 编辑

2018年4月16日

爬去酷狗top500的数据

摘要： import requests from bs4 import BeautifulSoup import time headers={ #'User-Agent':'Nokia6600/1.0 (3.42.1) SymbianOS/7.0s Series60/2.0 Profile/MIDP-2.0 Configuration/CLDC-1.0' 'User-Agent... 阅读全文

posted @ 2018-04-16 15:24 蓝勃斐重新开始阅读(146) 评论(0) 推荐(0) 编辑

BeautifulSoup库测试代码

摘要： import requests from bs4 import BeautifulSoup import time headers={ #'User-Agent':'Nokia6600/1.0 (3.42.1) SymbianOS/7.0s Series60/2.0 Profile/MIDP-2.0 Configuration/CLDC-1.0' 'User-Agent... 阅读全文

posted @ 2018-04-16 11:05 蓝勃斐重新开始阅读(146) 评论(0) 推荐(0) 编辑

2018年4月13日

爬去豆瓣图书top250数据存储到csv中

摘要： from lxml import etree import requests import csv fp=open('C://Users/Administrator/Desktop/lianxi/doubanbook.csv','w+',newline='',encoding='utf-8') writer=csv.writer(fp) writer.writerow(('name','url'... 阅读全文

posted @ 2018-04-13 14:48 蓝勃斐重新开始阅读(286) 评论(0) 推荐(0) 编辑

2018年4月9日

python爬虫之路——对斗破苍穹进行关键字提取，制作噪声云图

摘要：对贴吧也可以进行同样操作阅读全文

posted @ 2018-04-09 10:11 蓝勃斐重新开始阅读(202) 评论(0) 推荐(0) 编辑

公告