comedybeastradio.com
robots.txt

Robots Exclusion Standard data for comedybeastradio.com

Resource Scan

Scan Details

Site Domain comedybeastradio.com
Base Domain comedybeastradio.com
Scan Status Ok
Last Scan2024-05-20T11:41:34+00:00
Next Scan 2024-06-19T11:41:34+00:00

Last Scan

Scanned2024-05-20T11:41:34+00:00
URL https://comedybeastradio.com/robots.txt
Redirect https://www.comedybeastradio.com/robots.txt
Redirect Domain www.comedybeastradio.com
Redirect Base comedybeastradio.com
Domain IPs 199.34.228.72
Redirect IPs 199.34.228.72
Response IP 199.34.228.72
Found Yes
Hash 51a1cd692c296ff7702e4e0801428dd982ca22472f1906303739148f62abef91
SimHash 6a54d8746693

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /https%3A//shop.spreadshirt.com/comedy-beast-radio/
Disallow /https%3A//patreon.com/comedybeastradio

Other Records

Field Value
sitemap https://www.comedybeastradio.com/sitemap.xml