texastribune.org
robots.txt

Robots Exclusion Standard data for texastribune.org

Resource Scan

Scan Details

Site Domain texastribune.org
Base Domain texastribune.org
Scan Status Ok
Last Scan2024-04-30T23:26:10+00:00
Next Scan 2024-05-07T23:26:10+00:00

Last Scan

Scanned2024-04-30T23:26:10+00:00
URL https://texastribune.org/robots.txt
Redirect https://www.texastribune.org/robots.txt
Redirect Domain www.texastribune.org
Redirect Base texastribune.org
Domain IPs 104.22.38.184, 104.22.39.184, 172.67.24.106
Redirect IPs 104.22.38.184, 104.22.39.184, 172.67.24.106
Response IP 172.67.24.106
Found Yes
Hash f1c594ccab9574470509043bd2c8ad3adb0f5f1cef11d8d1e675de60faa4f4b6
SimHash e9a15c082c92

Groups

*

Rule Path
Disallow /admin/
Disallow /search/
Disallow /test/
Disallow /content/republish/
Disallow /accounts/login/?*
Disallow /*/search/
Disallow /feeds/main/*
Disallow /multimedia/images/
Disallow /library/data/campaign-finance/*/*
Disallow /library/data/government-employee-salaries/
Disallow /2017/05/23/private-post-session-livestream/
Disallow /theblast/20*

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.texastribune.org/sitemap.xml
sitemap https://www.texastribune.org/sitemap_news.xml
sitemap https://salaries.texastribune.org/sitemap.xml
sitemap https://schools.texastribune.org/sitemap.xml