startrib.com
robots.txt

Robots Exclusion Standard data for startrib.com

Resource Scan

Scan Details

Site Domain startrib.com
Base Domain startrib.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-12-18T22:38:45+00:00
Next Scan 2026-01-01T22:38:45+00:00

Last Successful Scan

Scanned2025-11-08T06:04:42+00:00
URL http://startrib.com/robots.txt
Redirect https://www.startribune.com/robots.txt
Redirect Domain www.startribune.com
Redirect Base startribune.com
Domain IPs 192.64.119.157
Redirect IPs 76.76.21.142, 76.76.21.61
Response IP 76.76.21.123
Found Yes
Hash c7b58e7955d958536fa1287fa623926aef645ab0029d5d9c2c9fbed2932e1e51
SimHash 68049a228bb3

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /login

*

Rule Path
Disallow /obituaries/search

Other Records

Field Value
sitemap https://www.startribune.com/sitemap-fresh-news-index.xml/
sitemap https://www.startribune.com/sitemap-fresh-video-index.xml/
sitemap https://www.startribune.com/sitemap-full-index.xml/