上一页 1 ··· 27 28 29 30 31 32 33 34 35 ··· 38 下一页
摘要: item.py pipeline.py spd.py 阅读全文
posted @ 2017-05-11 15:13 Erick-LONG 阅读(223) 评论(0) 推荐(0) 编辑
摘要: pipeline item 阅读全文
posted @ 2017-05-10 17:29 Erick-LONG 阅读(1726) 评论(0) 推荐(0) 编辑
摘要: rules = [ Rule(SgmlLinkExtractor(allow=('/u012150179/article/details'), restrict_xpaths=('//li[@class="next_article"]')), callback='parse_ite... 阅读全文
posted @ 2017-05-10 16:05 Erick-LONG 阅读(778) 评论(0) 推荐(0) 编辑
摘要: UA池 阅读全文
posted @ 2017-05-10 15:05 Erick-LONG 阅读(517) 评论(0) 推荐(0) 编辑
摘要: 放入项目目录,配置setting.py 阅读全文
posted @ 2017-05-10 14:19 Erick-LONG 阅读(649) 评论(0) 推荐(0) 编辑
摘要: class CsvspiderSpider(CSVFeedSpider): name = 'csvspider' allowed_domains = ['iqianyue.com'] start_urls = ['http://iqianyue.com/feed.csv'] headers = ['id', 'name', 'description', 'imag... 阅读全文
posted @ 2017-05-10 13:51 Erick-LONG 阅读(314) 评论(0) 推荐(0) 编辑
摘要: from scrapy.spiders import XMLFeedSpider from myxml.items import MyxmlItem class XmlspiderSpider(XMLFeedSpider): name = 'xmlspider' allowed_domains = ['sina.com.cn'] start_urls = ['http:... 阅读全文
posted @ 2017-05-10 13:35 Erick-LONG 阅读(209) 评论(0) 推荐(0) 编辑
摘要: import scrapy from Autopjt.items import myItem from scrapy.http import Request class AutospdSpider(scrapy.Spider): name = "fulong_spider" start_urls = 阅读全文
posted @ 2017-05-10 13:15 Erick-LONG 阅读(1678) 评论(0) 推荐(0) 编辑
摘要: pipeline部分 item部分 阅读全文
posted @ 2017-05-10 13:01 Erick-LONG 阅读(545) 评论(0) 推荐(0) 编辑
摘要: 1 #!/usr/bin/env python 2 # -*- coding:utf-8 -*- 3 4 ###########接口###################### 5 # class Iorderddd(): 6 # def fetch_one_by(self,nid): 7 # raise Exception('子类必须实现该... 阅读全文
posted @ 2017-05-07 17:31 Erick-LONG 阅读(168) 评论(0) 推荐(0) 编辑
上一页 1 ··· 27 28 29 30 31 32 33 34 35 ··· 38 下一页