scrapy中的xpath用法和css的用法

css

response.css(".list-left dd:not(.page)")

img.css("a::text").extract_first()

img.css("a::attr(href)").extract_first()

response.css(".page-en:nth-last-child(2)::attr(href)").extract_first()

result = html.xpath('//li/a[@href="link1.html"]')

result = html.xpath('//li[last()]/a/@href')

result = html.xpath('//li[last()-1]/a')

#result = html.xpath('//li/span')
#注意这么写是不对的：
#因为 / 是用来获取子元素的，而 <span> 并不是 <li> 的子元素，所以，要用双斜杠

result = html.xpath('//li//span')

posted @ 2018-10-22 13:13 发疯的man 阅读(1430) 评论(0) 收藏举报

刷新页面返回顶部