摘要:
直接上代码: 脱敏后自用的py采集代码, #!/usr/bin/env python # -*- coding:utf-8 -*- """ @author:Andy @file:xxx.py @time:下午05:50 @desc:采集的文章数据进博客 """ import os import re 阅读全文
摘要:
import re import requests r = request.get('http://www.163.com') data = r.text link_list = re.findall(r"(?<=href=\").+?(?=\")", data) for url in link_l 阅读全文