robots.txt

A robots.txt file tells search engine crawlers which pages or files the crawler can or can't request from your site.

https://www.robotstxt.org/

https://support.google.com/webmasters/answer/6062608?hl=en

SEO

https://support.google.com/webmasters/answer/6062608?hl=zh-Hans

https://abc.xgqfrms.xyz/robots.txt

# Robots.txt 是存放在站点根目录下的一个纯文本文件。
# 虽然它的设置很简单，但是作用却很强大。
# 它可以指定搜索引擎蜘蛛只抓取指定的内容，或者是禁止搜索引擎蜘蛛抓取网站的部分或全部内容。
User-agent: Baiduspider
Disallow: /
User-agent: Sosospider
Disallow: /
User-agent: sogou spider
Disallow: /
User-agent: YodaoBot
Disallow: /
User-agent: Googlebot
Disallow: 
User-agent: Bingbot
Disallow: 
User-agent: Slurp
Disallow: 
User-agent: Teoma
Disallow: 
User-agent: ia_archiver
Disallow: 
User-agent: twiceler
Disallow: 
User-agent: MSNBot
Disallow: 
User-agent: Scrubby
Disallow: 
User-agent: Robozilla
Disallow: 
User-agent: Gigabot
Disallow: 
User-agent: googlebot-image
Disallow: 
User-agent: googlebot-mobile
Disallow: 
User-agent: yahoo-mmcrawler
Disallow: 
User-agent: yahoo-blogs/v3.9
Disallow: 
User-agent: psbot
Disallow: 
User-agent: *
Disallow: 
Disallow: /bin/

posted @ 2020-03-26 22:10 xgqfrms 阅读(286) 评论(2) 编辑收藏举报

刷新页面返回顶部

登录后才能查看或发表评论，立即登录或者逛逛博客园首页

阅读排行：
· DeepSeek 开源周回顾「GitHub 热点速览」
· 记一次.NET内存居高不下排查解决与启示
· 物流快递公司核心技术能力-地址解析分单基础技术分享
· .NET 10首个预览版发布：重大改进与新特性概览！
· .NET10 - 预览版1新功能体验（一）

历史上的今天：
2019-03-26 Fetch delete API & HTTP Methods All In One
2016-03-26 Windows 10 系统下的 shutdown 命令大全 All In One
2016-03-26 How to reset your password in Ubuntu x64!
2016-03-26 从GitHub Jobs! 看技术发展趋势！程序员进阶必备！

公告

xgqfrms™, xgqfrms® : xgqfrms's offical website of cnblogs!

© xgqfrms 2012-2025

昵称： xgqfrms
园龄： 9年11个月
粉丝： 43
关注： 0

+加关注

2025年3月

日

一

二

三

四

五

六

xgqfrms

welcome to xgqfrms's official blogs of cnblogs!

robots.txt

robots.txt

SEO

公告

© xgqfrms 2012-2025

搜索

最新随笔

我的标签

积分与排名

合集 (6)

随笔档案 (6267)

阅读排行榜