caracolnoticias.com
robots.txt

Robots Exclusion Standard data for caracolnoticias.com

Resource Scan

Scan Details

Site Domain caracolnoticias.com
Base Domain caracolnoticias.com
Scan Status Ok
Last Scan2024-06-24T03:07:44+00:00
Next Scan 2024-07-01T03:07:44+00:00

Last Scan

Scanned2024-06-24T03:07:44+00:00
URL https://caracolnoticias.com/robots.txt
Redirect https://www.noticiascaracol.com/robots.txt
Redirect Domain www.noticiascaracol.com
Redirect Base noticiascaracol.com
Domain IPs 52.10.116.134, 52.24.33.209
Redirect IPs 3.165.102.119, 3.165.102.57, 3.165.102.68, 3.165.102.86
Response IP 3.165.102.57
Found Yes
Hash 7de314d97b9f8cc9ac9f903743804b713cbf1c0d56d400209e01e7b0229143d1
SimHash 41213371ab83

Groups

*

Rule Path
Allow /
Disallow /_track
Disallow /pushnotifications/*
Disallow /instant-articles-ads

grapeshot

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.noticiascaracol.com/sitemap.xml
sitemap https://www.noticiascaracol.com/section-sitemap.xml
sitemap https://www.noticiascaracol.com/tag-sitemap.xml
sitemap https://www.noticiascaracol.com/content-sitemap.xml
sitemap https://www.noticiascaracol.com/image-sitemap.xml
sitemap https://www.noticiascaracol.com/media-sitemap.xml
sitemap https://www.noticiascaracol.com/author-sitemap-content.xml
sitemap https://www.noticiascaracol.com/index-sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/section-sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/tag-sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/content-sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/image-sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/media-sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/author-sitemap-content.xml
sitemap https://www.noticiascaracol.com/deportes/index-sitemap.xml
sitemap https://www.noticiascaracol.com/deportes/content-sitemap-latest.xml
sitemap https://www.noticiascaracol.com/golcaracol/sitemap.xml
sitemap https://www.noticiascaracol.com/golcaracol/section-sitemap.xml
sitemap https://www.noticiascaracol.com/golcaracol/tag-sitemap.xml
sitemap https://www.noticiascaracol.com/golcaracol/content-sitemap.xml
sitemap https://www.noticiascaracol.com/golcaracol/image-sitemap.xml
sitemap https://www.noticiascaracol.com/golcaracol/media-sitemap.xml
sitemap https://www.noticiascaracol.com/golcaracol/author-sitemap-content.xml
sitemap https://www.noticiascaracol.com/golcaracol/index-sitemap.xml
sitemap https://www.noticiascaracol.com/golcaracol/content-sitemap-latest.xml