【python】获取指定网页上的所有超级链接

Posted on 2016-09-08 15:59 毕加索的ma 阅读(1959) 评论(0) 编辑收藏举报

# -*- coding: utf-8 -*-
import urllib2
import re

#connect to a URL
website = urllib2.urlopen("http://www.baidu.com")
#read html code
html = website.read()
#use re.findall to get all the links
links = re.findall('"((http|ftp)s?://.*?)"', html)  ###".*?"任意匹配
print links

会员力量，点亮园子希望

刷新页面返回顶部

菜比之路

公告

【python】获取指定网页上的所有超级链接