网络爬虫作业

整体思路还行,但是细节不清楚,有很多问题,json库不太会,只能写这么多,那个result+=......是抄袭的,可以理解一点,有几个不懂

import requests
import json
url='https://edu.cnblogs.com/Homework/GetAnswers?homeworkId=2420&_=1542959851766'
def get_html(url):
r=requests.get(url)
r.raise_for_status()
r.encoding='utf-8'
return r.text
datas=json.loads(r.text)['data']
result=""
for data in datas:
result+=str(data['StudentNo'])+','+data['RealName']+','+data['DateAdded'].replace('T',' ')+','+data['Title']+','+data['Url']+'\n'
f=open(hwlist.csv,'w')
f.wright(result)

  

  

posted on 2018-12-07 22:47  大江东回去  阅读(103)  评论(0编辑  收藏  举报

导航