网络爬虫作业
整体思路还行,但是细节不清楚,有很多问题,json库不太会,只能写这么多,那个result+=......是抄袭的,可以理解一点,有几个不懂
import requests import json url='https://edu.cnblogs.com/Homework/GetAnswers?homeworkId=2420&_=1542959851766' def get_html(url): r=requests.get(url) r.raise_for_status() r.encoding='utf-8' return r.text datas=json.loads(r.text)['data'] result="" for data in datas: result+=str(data['StudentNo'])+','+data['RealName']+','+data['DateAdded'].replace('T',' ')+','+data['Title']+','+data['Url']+'\n' f=open(hwlist.csv,'w') f.wright(result)