05 2018 档案

requests--etree--xpath

摘要：# -*- coding: cp936 -*- import requests from lxml import etree url = 'https://weibo.cn/pub/' html = requests.get(url).content #先用.content再用etree.HTML(html)方法，不然报错 selector = etree.HTML(html) ''' #... 阅读全文

posted @ 2018-05-25 21:05 了解2号阅读(431) 评论(0) 推荐(0) 编辑

python-requests

摘要：content是bytes数据，包括图片等二进制数据；text是网页代码 content在python 2.7版本中可以顺利打印出网页代码；但是在Python3.6上面打印的中文是乱码，而且很卡，代码持续行状态 text在python 2.7版本中打印出网页代码中文乱码；在Python3.6上面打印阅读全文

posted @ 2018-05-25 15:00 了解2号阅读(245) 评论(0) 推荐(0) 编辑

python正则表达式03--字符串中匹配数字

摘要：\d+使用匹配数字阅读全文

posted @ 2018-05-22 19:32 了解2号阅读(35285) 评论(0) 推荐(0) 编辑

python正则表达式02--findall()和search()方法区别，group()方法

摘要：import re st = 'asxxixxsaefxxlovexxsdwdxxyouxxde' #search()和 findall()的区别 a = re.search('xx(.*?)xxsaefxx(.*?)xxsdwdxx(.*?)xx',st) #print(a) #运行结果 # #group()方法 b = re.search('xx(.*?)xxsaefxx(.*?)x... 阅读全文

posted @ 2018-05-22 19:20 了解2号阅读(2929) 评论(0) 推荐(0) 编辑

python正则表达式01--贪心算法和非贪心算法findall()

摘要：贪心算法，非贪心算法阅读全文

posted @ 2018-05-22 19:06 了解2号阅读(1474) 评论(0) 推荐(0) 编辑