试着给VuePress添加全局禁止爬取支持，基于vuepress-plugin-robots

背景

有时候，我们有些内部网站希望不被外部抓取，那么我们可以借助vuepress-plugin-robots来生成robots.txt文件，来告诉爬虫不要抓取页面。

安装

npm install vuepress-plugin-robots

项目地址：https://github.com/HiYue/vuepress-plugin-robots

配置

准备一个sitemap.xml文件，位置随意，路径和下文对应上就行。

<xml version="1.0" encoding="UTF-8" />

在.vuepress/config.js中追加项plugins-robots

plugins: {
        'robots': {
            host: "http://www.example.com",
            disallowAll: true,
            sitemap: "/assets/xml/sitemap.xml",
        },
    }

其中，

host是必填项，填写网站域名，
disallowAll是true，代表禁止所有爬虫，如果要放开，需要设置成false
sitemap是必填项

效果

编译完成后

我们将得到一个robots.txt文件，路径是：http://www.example.com/robots.txt

同时得到一个sitemap.xml文件，路径是：http:///www.example.com/assets/xml/sitemap.xml

posted @ 2020-10-09 15:09 TaylorShi 阅读(1184) 评论(0) 编辑收藏举报

努力加载评论中...

刷新页面返回顶部

公告

昵称： TaylorShi
园龄： 13年
粉丝： 99
关注： 6

+加关注

2025年2月

日

一

二

三

四

五

六

随笔分类

随笔档案

文章档案

2013年6月(1)

TaylorShi

试着给VuePress添加全局禁止爬取支持，基于vuepress-plugin-robots

背景

安装

配置

效果

公告

搜索

常用链接

我的标签

随笔分类

随笔档案

文章档案

阅读排行榜

评论排行榜

推荐排行榜

最新评论