ctrl.blog
robots.txt

Robots Exclusion Standard data for ctrl.blog

Resource Scan

Scan Details

Site Domain ctrl.blog
Base Domain ctrl.blog
Scan Status Ok
Last Scan2025-06-01T19:03:10+00:00
Next Scan 2025-06-08T19:03:10+00:00

Last Scan

Scanned2025-06-01T19:03:10+00:00
URL https://ctrl.blog/robots.txt
Redirect https://www.ctrl.blog/robots.txt
Redirect Domain www.ctrl.blog
Redirect Base ctrl.blog
Domain IPs 173.230.138.109, 2600:3c02::f03c:92ff:fe16:aefa, 2a01:4f8:c2c:5003::aaaa, 78.46.164.196
Redirect IPs 138.199.46.68, 2400:52e0:1500::868:1
Response IP 138.199.24.219
Found Yes
Hash beae36ebd8cd9f9035519e4586e5c23d54d0a68c104055293221ca7396ab99c3
SimHash 63065385f4e1

Groups

domain re-animator bot
obot
turnitinbot

Rule Path
Disallow /

genai
gptbot
applebot-extended
google-extended

Rule Path
Disallow /entry/

claudebot

Rule Path
Allow /entry/

*

Rule Path
Disallow /api/
Disallow /assets/errorpage/
Disallow /0c491184f9.html
Disallow /google88*
Disallow /pktdd0d6*
Disallow /wmail_*
Disallow /yandex_*
Disallow /BingSite*
Disallow /0c49118*
Disallow /visit

Other Records

Field Value
sitemap https://www.ctrl.blog/sitemap.xml

Comments

  • Hi robots! Glad you came!
  • Humans, please refer to /humans.txt