huahuayu - 博客园

2018年1月12日

摘要： 1. beautifulsoup 获取标签内容 https://cuiqingcai.com/1319.html 2.正则匹配优先量词与忽略优先量词 https://www.cnblogs.com/nzbbody/p/4391792.html 3. 列表倒序 4.豆瓣API使用 5.python单下阅读全文

posted @ 2018-01-12 23:43 huahuayu 阅读(268) 评论(0) 推荐(0) 编辑

2018年1月9日

Python正则表达式返回首次匹配到的字符及查询的健壮性

摘要： re.findall(pattern,string)会搜索所有匹配的字符，返回的是一个列表，获取首个匹配需要re.findall(pattern,string)[0]访问, 但是如果findall没匹配成功则返回空列表，这时用列表下标去访问元素时就会报IndexError: list index o 阅读全文

posted @ 2018-01-09 21:51 huahuayu 阅读(3335) 评论(0) 推荐(0) 编辑

2018年1月7日

【转载】Python BeautifulSoup匹配字符串

摘要： BeautifulSoup中可以通过name和attrs去定位名称和属性，以找到特定的html代码。更值得称赞的是，attrs支持正则表达式。如： <div class="cool"> <h1 class="abc">design</h1> </div> 搜索此行，可以这样写 abcSoup = 阅读全文

posted @ 2018-01-07 16:36 huahuayu 阅读(4454) 评论(0) 推荐(0) 编辑

Python正则表达式

摘要： 1. 获取字符串中间的一段内容阅读全文

posted @ 2018-01-07 14:15 huahuayu 阅读(269) 评论(0) 推荐(0) 编辑

Python pandas DataFrame操作

摘要： 1. 从字典创建Dataframe 2. 从列表创建Dataframe (先把列表转化为字典，再把字典转化为DataFrame） 3. 从列表创建DataFrame，指定data和columns 4. 修改列名，从['id','name','sex']修改为['Id','Name','Sex'] 5 阅读全文

posted @ 2018-01-07 10:36 huahuayu 阅读(35108) 评论(0) 推荐(0) 编辑

2018年1月6日

Python学友

摘要：独学而无友，则孤陋而寡闻，python学习过程中希望多和学友交流，一起进步。开源中国 j_hao104 微信公众号: Pythoner每日一报 https://my.oschina.net/jhao104/home 也在学习python的cnblog网友 aubucuo https://www.c 阅读全文

posted @ 2018-01-06 00:22 huahuayu 阅读(205) 评论(0) 推荐(0) 编辑

2018年1月5日

Python学习资源

摘要： Python学习过程中觉得不错的学习资源记录于此，长期更新：用Python玩转数据 Data Processing Using Python - Coursera https://www.coursera.org/learn/hipython/home/welcome Python 爬虫学习系列教阅读全文

posted @ 2018-01-05 22:47 huahuayu 阅读(286) 评论(0) 推荐(0) 编辑

Python爬虫通过替换http request header来欺骗浏览器实现登录

摘要：以豆瓣为例，访问https://www.douban.com/contacts/list 来查看自己关注的人，要登录才能查看。如果用requests.get()方法获取这个http，没登录只能抓取回一个登录界面，所以我们要用Python登录网站才能抓取想要的网页。一个简便的方法就是自己在浏览器上阅读全文

posted @ 2018-01-05 22:07 huahuayu 阅读(3009) 评论(3) 推荐(0) 编辑

2018年1月2日

Python sort方法

摘要：官方文档： sort(*, key=None, reverse=False) This method sorts the list in place, using only < comparisons between items. Exceptions are not suppressed - if 阅读全文

posted @ 2018-01-02 23:44 huahuayu 阅读(2577) 评论(0) 推荐(1) 编辑

Python删除list中多个相同元素

摘要： pop和remove方法都可以删除list中的元素，个人更倾向于使用pop方法。 pop方法：删除过程中还能返回被删除的值 remove方法：从左往右，删除首次出现的指定元素删除过程不会返回被删除的值阅读全文

posted @ 2018-01-02 23:10 huahuayu 阅读(23480) 评论(0) 推荐(2) 编辑

huahuayu's notes

公告