es.thechurchnews.com
robots.txt

Robots Exclusion Standard data for es.thechurchnews.com

Resource Scan

Scan Details

Site Domain es.thechurchnews.com
Base Domain thechurchnews.com
Scan Status Ok
Last Scan2024-11-03T11:43:53+00:00
Next Scan 2024-11-17T11:43:53+00:00

Last Scan

Scanned2024-11-03T11:43:53+00:00
URL https://es.thechurchnews.com/robots.txt
Domain IPs 23.49.60.48, 23.49.60.59, 2600:1413:b000:13::b857:c192, 2600:1413:b000:13::b857:c196
Response IP 23.45.207.166
Found Yes
Hash 3bec8634751384faa06190595ca73b70e11d3dad3916a894be4b7c57c59b75f1
SimHash 5405c8c0e113

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://es.thechurchnews.com/arc/outboundfeeds/sitemap-index/
sitemap https://es.thechurchnews.com/arc/outboundfeeds/sitemap-news-index/
sitemap https://es.thechurchnews.com/arc/outboundfeeds/sitemap-section-index/
sitemap https://es.thechurchnews.com/arc/outboundfeeds/sitemap-index-year/
sitemap https://media.thechurchnews.com/sitemaps/churchnews-es/sitemap-index.xml