caracoltv.co
robots.txt

Robots Exclusion Standard data for caracoltv.co

Resource Scan

Scan Details

Site Domain caracoltv.co
Base Domain caracoltv.co
Scan Status Ok
Last Scan2024-06-05T09:09:06+00:00
Next Scan 2024-06-12T09:09:06+00:00

Last Scan

Scanned2024-06-05T09:09:06+00:00
URL https://caracoltv.co/robots.txt
Redirect https://www.caracoltv.com/robots.txt
Redirect Domain www.caracoltv.com
Redirect Base caracoltv.com
Domain IPs 52.34.221.251, 52.39.22.168
Redirect IPs 3.165.102.119, 3.165.102.57, 3.165.102.68, 3.165.102.86
Response IP 3.165.102.119
Found Yes
Hash 083ecc9aacc45a5909a6b9bd63c61824b5190c02b2f2f899868125d149cb88d1
SimHash 0f145b61e333

Groups

*

Rule Path
Disallow /_track
Disallow /pushnotifications/*
Disallow /instant-articles-ads

grapeshot

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.caracoltv.com/sitemap.xml
sitemap https://www.caracoltv.com/section-sitemap.xml
sitemap https://www.caracoltv.com/tag-sitemap.xml
sitemap https://www.caracoltv.com/content-sitemap.xml
sitemap https://www.caracoltv.com/image-sitemap.xml
sitemap https://www.caracoltv.com/media-sitemap.xml
sitemap https://www.caracoltv.com/author-sitemap-content.xml
sitemap https://www.caracoltv.com/index-sitemap.xml

Comments

  • sitemaps