radio.co
robots.txt

Robots Exclusion Standard data for radio.co

Resource Scan

Scan Details

Site Domain radio.co
Base Domain radio.co
Scan Status Ok
Last Scan2024-09-21T15:02:54+00:00
Next Scan 2024-09-28T15:02:54+00:00

Last Scan

Scanned2024-09-21T15:02:54+00:00
URL https://radio.co/robots.txt
Domain IPs 104.22.46.146, 104.22.47.146, 172.67.23.56, 2606:4700:10::6816:2e92, 2606:4700:10::6816:2f92, 2606:4700:10::ac43:1738
Response IP 104.22.47.146
Found Yes
Hash ff2a2cdefbd73a6b2c34172353572a74d8ce146f98ddc54c29ad03621d98a18f
SimHash 21081d16ef15

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /cdn-cgi/
Disallow /admin/
Disallow *studio.radio.co/checkout*
Disallow *status.radio.co/*
Allow *status.radio.co
Disallow *sites.radio.co*

Other Records

Field Value
sitemap https://radio.co/sitemaps-1-sitemap.xml
sitemap https://radio.co/sitemap.xml
sitemap https://radio.co/video-sitemap.xml

Comments

  • robots.txt for https://radio.co/
  • live - don't allow web crawlers to index cpresources/ or vendor/