cherryblossomwatch.com
robots.txt

Robots Exclusion Standard data for cherryblossomwatch.com

Resource Scan

Scan Details

Site Domain cherryblossomwatch.com
Base Domain cherryblossomwatch.com
Scan Status Ok
Last Scan2024-11-15T06:44:24+00:00
Next Scan 2024-11-22T06:44:24+00:00

Last Scan

Scanned2024-11-15T06:44:24+00:00
URL https://cherryblossomwatch.com/robots.txt
Domain IPs 104.25.131.70, 104.25.132.70, 172.67.83.209, 2606:4700:20::6819:8346, 2606:4700:20::6819:8446, 2606:4700:20::ac43:53d1
Response IP 172.67.83.209
Found Yes
Hash 3339632cfb9a9a8ab7cd0d69786b9877726ac61166b1fd9d2b17014dab3c39ec
SimHash 18189d02e8ba

Groups

*

Rule Path
Disallow */?s=
Disallow /search/
Disallow */?amp=1&s=
Disallow /wp-login.php
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /feed/
Disallow /*/feed/

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ahrefsbot
mj12bot
semrushbot
siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
splitsignalbot
semrushbot-coub
rogerbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cherryblossomwatch.com/sitemaps.xml

Comments

  • All Bots
  • Block AI crawlers
  • Block unwanted SEO bots
  • Sitemap