caracoltv.com
robots.txt

Robots Exclusion Standard data for caracoltv.com

Resource Scan

Scan Details

Site Domain caracoltv.com
Base Domain caracoltv.com
Scan Status Ok
Last Scan2024-10-29T23:27:08+00:00
Next Scan 2024-11-05T23:27:08+00:00

Last Scan

Scanned2024-10-29T23:27:08+00:00
URL https://caracoltv.com/robots.txt
Redirect https://www.caracoltv.com/robots.txt
Redirect Domain www.caracoltv.com
Redirect Base caracoltv.com
Domain IPs 35.165.175.243, 52.10.18.78
Redirect IPs 3.165.102.119, 3.165.102.57, 3.165.102.68, 3.165.102.86
Response IP 13.35.238.101
Found Yes
Hash 083ecc9aacc45a5909a6b9bd63c61824b5190c02b2f2f899868125d149cb88d1
SimHash 0f145b61e333

Groups

*

Rule Path
Disallow /_track
Disallow /pushnotifications/*
Disallow /instant-articles-ads

grapeshot

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.caracoltv.com/sitemap.xml
sitemap https://www.caracoltv.com/section-sitemap.xml
sitemap https://www.caracoltv.com/tag-sitemap.xml
sitemap https://www.caracoltv.com/content-sitemap.xml
sitemap https://www.caracoltv.com/image-sitemap.xml
sitemap https://www.caracoltv.com/media-sitemap.xml
sitemap https://www.caracoltv.com/author-sitemap-content.xml
sitemap https://www.caracoltv.com/index-sitemap.xml

Comments

  • sitemaps