ldschurchnews.com
robots.txt

Robots Exclusion Standard data for ldschurchnews.com

Resource Scan

Scan Details

Site Domain ldschurchnews.com
Base Domain ldschurchnews.com
Scan Status Ok
Last Scan2024-08-30T22:14:34+00:00
Next Scan 2024-09-29T22:14:34+00:00

Last Scan

Scanned2024-08-30T22:14:34+00:00
URL https://www.ldschurchnews.com/robots.txt
Redirect https://www.thechurchnews.com/robots.txt
Redirect Domain www.thechurchnews.com
Redirect Base thechurchnews.com
Domain IPs 172.66.41.22, 172.66.42.234, 2606:4700:3108::ac42:2916, 2606:4700:3108::ac42:2aea
Redirect IPs 23.52.171.113, 23.52.171.88, 2600:1413:b000:13::b857:c188, 2600:1413:b000:13::b857:c190
Response IP 23.45.207.169
Found Yes
Hash 5ecb910f49cf2ec7bd34d1dd931a68d88f416ae53e43379cbef893b8e24368e0
SimHash 640091684511

Groups

gptbot

Rule Path
Allow /almanac/
Disallow /

google-extended

Rule Path
Allow /almanac/
Disallow /

anthropic-ai

Rule Path
Allow /almanac/
Disallow /

cohere-ai

Rule Path
Allow /almanac/
Disallow /

omgili

Rule Path
Allow /almanac/
Disallow /

omgilibot

Rule Path
Allow /almanac/
Disallow /

piplbot

Rule Path
Allow /almanac/
Disallow /

bytespider

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.thechurchnews.com/arc/outboundfeeds/sitemap-index/
sitemap https://www.thechurchnews.com/arc/outboundfeeds/sitemap-news-index/
sitemap https://www.thechurchnews.com/arc/outboundfeeds/sitemap-section-index/
sitemap https://www.thechurchnews.com/arc/outboundfeeds/sitemap-index-year/
sitemap https://media.thechurchnews.com/sitemaps/churchnews/sitemap-index.xml