guidelive.com
robots.txt

Robots Exclusion Standard data for guidelive.com

Resource Scan

Scan Details

Site Domain guidelive.com
Base Domain guidelive.com
Scan Status Ok
Last Scan2024-06-14T18:34:13+00:00
Next Scan 2024-06-21T18:34:13+00:00

Last Scan

Scanned2024-06-14T18:34:13+00:00
URL https://guidelive.com/robots.txt
Redirect https://www.dallasnews.com/robots.txt
Redirect Domain www.dallasnews.com
Redirect Base dallasnews.com
Domain IPs 15.197.248.213, 3.33.236.11
Redirect IPs 23.44.4.208, 23.44.4.210, 2600:1413:b000:13::b857:c18e, 2600:1413:b000:13::b857:c19b
Response IP 42.99.140.186
Found Yes
Hash dc7c2babe1fa3877228127944daa7bfe10e4643559b9a4e5e6631879a0dfbcd0
SimHash 690cd940e193

Groups

*

Rule Path
Allow /

magnetbot

Rule Path
Allow /

*

Rule Path
Disallow /help/thank*
Disallow /checkout/*
Disallow /offers*
Disallow /subscribe*

magnetbot

Rule Path
Disallow /checkout/*
Disallow /offers*
Disallow /subscribe*

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dallasnews.com/arc/outboundfeeds/sitemap-index/?outputType=xml