sdhssjzz.com
robots.txt

Robots Exclusion Standard data for sdhssjzz.com

Resource Scan

Scan Details

Site Domain sdhssjzz.com
Base Domain sdhssjzz.com
Scan Status Ok
Last Scan2025-06-22T16:54:53+00:00
Next Scan 2025-07-22T16:54:53+00:00

Last Scan

Scanned2025-06-22T16:54:53+00:00
URL https://sdhssjzz.com/robots.txt
Domain IPs 104.21.90.35, 172.67.194.38, 2606:4700:3031::6815:5a23, 2606:4700:3033::ac43:c226
Response IP 104.21.90.35
Found Yes
Hash b570e2c75609daa995c1e91e796a8ff4b24724d66be81c85b467132edc5ca2dd
SimHash ec3551f2971a

Groups

*

Rule Path
Disallow /

baiduspider

Rule Path
Disallow

googlebot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Comments

  • 默认屏蔽所有爬虫访问网站å†
  • å
  • 屏蔽谷歌爬虫
  • 屏蔽Bing爬虫
  • 屏蔽搜狗爬虫
  • 屏蔽360搜索爬虫
  • 屏蔽神马搜索
  • 屏蔽Ahrefs爬虫(站点分析工å
  • 屏蔽Semrush爬虫
  • 屏蔽MJ12bot(SEOå·¥å
  • 屏蔽Yandex(俄罗斯搜索引擎)

Warnings

  • 7 invalid lines.