如何刷博客园阅读量
点击我前往Github查看源代码 别忘记star
本项目github地址:https://github.com/wangqifan/ZhiHu
博客园是如何记录阅读量的?
阅读一篇文章,如果刷新页面,阅读量并不会增加。博客园是如何来计算阅读量的嗯。先看看http请求
阅读量就是靠这个请求来记录的
get http://counter.cnblogs.com/blog/post/6323533
Host: counter.cnblogs.com User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:49.0) Gecko/20100101 Firefox/49.0 Accept: */* Accept-Language: zh-CN,zh;q=0.8,en-US;q=0.5,en;q=0.3 Accept-Encoding: gzip, deflate Referer: http://www.cnblogs.com/zuin/p/6323533.html Cookie: .CNBlogsCookie=956FC9404C44EBFB70E9E67DA4082C0CDE75C675E827CF7506F080AAB6F30566F0B2FE2344A48ABD6C36955C4F109F4223DE59BDB7ED16B289F4687BF794B9D583E7A20D2F7BB56D1F6E7AB2CF6594F621C107BC Connection: keep-alive If-Modified-Since: Fri, 17 Feb 2017 04:02:27 GMT Cache-Control: max-age=0
现在来写程序模拟这个行为 (NOTE:需要伪装成不同的IP和不同的浏览器)
建立UserAgent池
private static string[] UserAgentPool = new string[] { "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/51.0.2704.79 Safari/537.36 Edge/14.14393",//Edge "Mozilla/5.0 (Windows NT 6.3; Trident/7.0; rv:11.0) like Gecko",//IE11 "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36",//Chrome "Mozilla/5.0 (Windows NT 10.0; WOW64; rv:50.0) Gecko/20100101 Firefox/50.0",// Firefox "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36 OPR/42.0.2393.94",//Opera "Mozilla/5.0 (Linux; Android 5.1.1; Nexus 6 Build/LYZ28E) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/48.0.2564.23 Mobile Safari/537.36", "Mozilla/5.0 (iPhone; CPU iPhone OS 9_1 like Mac OS X) AppleWebKit/601.1.46 (KHTML, like Gecko) Version/9.0 Mobile/13B143 Safari/601.1", "Mozilla/5.0 (Linux; U; Android 4.0.3; zh-cn; M032 Build/IML74K) AppleWebKit/533.1 (KHTML, like Gecko)Version/4.0 MQQBrowser/4.1 Mobile Safari/533.1", "MQQBrowser/38 (iOS 4; U; CPU like Mac OS X; zh-cn)", "Mozilla/5.0 (Windows Phone 10.0; Android 4.2.1; Microsoft; Lumia 950) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/46.0.2486.0 Mobile Safari/537.36 Edge/14.14263" };
建立代理池
项目地址:https://github.com/wangqifan/ProxyPool 自己写的 定时抓取网上免费代理 求star
代码放到github上面
https://github.com/wangqifan/cnblogs
其他项目推荐:
知乎爬虫 :https://github.com/wangqifan/ZhiHu博客推荐:
反反爬虫策略:http://www.cnblogs.com/zuin/p/6323533.html