deseretnews.com
robots.txt

Robots Exclusion Standard data for deseretnews.com

Resource Scan

Scan Details

Site Domain deseretnews.com
Base Domain deseretnews.com
Scan Status Ok
Last Scan2024-05-17T15:14:00+00:00
Next Scan 2024-05-24T15:14:00+00:00

Last Scan

Scanned2024-05-17T15:14:00+00:00
URL https://deseretnews.com/robots.txt
Redirect https://www.deseret.com/robots.txt
Redirect Domain www.deseret.com
Redirect Base deseret.com
Domain IPs 104.22.4.19, 104.22.5.19, 172.67.15.27, 2606:4700:10::6816:413, 2606:4700:10::6816:513, 2606:4700:10::ac43:f1b
Redirect IPs 125.56.219.10, 2600:1413:b000:14::b857:c148, 2600:1413:b000:14::b857:c14b, 96.17.72.81
Response IP 23.202.33.193
Found Yes
Hash 567f8b5dc4b962c3e3e03ef96729b150ec0edff7f68b80afe080e938e30a1568
SimHash 4004d94086d7

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.deseret.com/arc/outboundfeeds/sitemap-index/
sitemap https://www.deseret.com/arc/outboundfeeds/sitemap-news-index/
sitemap https://www.deseret.com/arc/outboundfeeds/sitemap-section-index/
sitemap https://www.deseret.com/arc/outboundfeeds/sitemap-index-year/
sitemap https://uploads.deseret.com/sitemaps/deseretnews/sitemap-index.xml