scrapy shell 使用案例

使用scrapy shell http://bj.maitian.cn/esfall访问麦田房产北京的二手房，得到response:第一页的html
目标：获取标题、价格、面积、区的信息
标题：response.xpath('//div[@class="list_title"]/h1/a/text()').extract_first()
extract_first()：只提取第一个值
//div:找到所有的div标签
[@class=""]:按标签属性的值查找
价格：response.xpath('//div[@class="list_title"]/div[@class="the_price"]/ol/strong/span/text()').extract()
面积：response.xpath('//div[@class="list_title"]/p/span[1]/text()').extract_first()
区：response.xpath('//div[@class="list_title"]/p/text()').extract_first().re(r'昌平|朝阳|东城|大兴|房山|丰台|海淀|门头沟|平谷|石景山|顺义|通州|西城')

posted @ 2020-04-22 12:42 wind_y 阅读(172) 评论(0) 编辑收藏举报

刷新页面返回顶部

三夕