blog.devhitao.com
robots.txt

Robots Exclusion Standard data for blog.devhitao.com

Resource Scan

Scan Details

Site Domain blog.devhitao.com
Base Domain devhitao.com
Scan Status Ok
Last Scan2025-09-24T14:50:29+00:00
Next Scan 2025-10-08T14:50:29+00:00

Last Scan

Scanned2025-09-24T14:50:29+00:00
URL https://blog.devhitao.com/robots.txt
Domain IPs 104.21.16.138, 172.67.212.236, 2606:4700:3031::ac43:d4ec, 2606:4700:3035::6815:108a
Response IP 172.67.212.236
Found Yes
Hash 7fb5b03041caf95245896f2ae099d4e2f28bb5fead8c5e363df725a430aa4504
SimHash a0a2b30eeed2

Groups

*

Rule Path
Disallow /page*
Disallow /page/
Disallow /archives*
Disallow /archives/
Disallow /tags*
Disallow /tags/
Disallow /categories*
Disallow /categories/
Disallow /privacy*
Disallow /link/
Disallow /site/
Disallow /baidu_verify_IH05gTwXFo.html
Disallow /googlecb9885b940fa1d4b.html

Comments

  • 该文件存放在根目录下,eg: http://blog.devhitao.com/robots.txt
  • 阻止爬取 page、archives、tags、categories 文件和目录