caracoltv.com
robots.txt
Robots Exclusion Standard data for caracoltv.com
Resource Scan
Scan Details
Site Domain | caracoltv.com |
Base Domain | caracoltv.com |
Scan Status | Ok |
Last Scan | 2024-10-29T23:27:08+00:00 |
Next Scan | 2024-11-05T23:27:08+00:00 |
Last Scan
Scanned | 2024-10-29T23:27:08+00:00 |
URL | https://caracoltv.com/robots.txt |
Redirect | https://www.caracoltv.com/robots.txt |
Redirect Domain | www.caracoltv.com |
Redirect Base | caracoltv.com |
Domain IPs | 35.165.175.243, 52.10.18.78 |
Redirect IPs | 3.165.102.119, 3.165.102.57, 3.165.102.68, 3.165.102.86 |
Response IP | 13.35.238.101 |
Found | Yes |
Hash | 083ecc9aacc45a5909a6b9bd63c61824b5190c02b2f2f899868125d149cb88d1 |
SimHash | 0f145b61e333 |
Groups
*
Rule | Path |
---|---|
Disallow | /_track |
Disallow | /pushnotifications/* |
Disallow | /instant-articles-ads |
Other Records
Field | Value |
---|---|
sitemap | https://www.caracoltv.com/sitemap.xml |
sitemap | https://www.caracoltv.com/section-sitemap.xml |
sitemap | https://www.caracoltv.com/tag-sitemap.xml |
sitemap | https://www.caracoltv.com/content-sitemap.xml |
sitemap | https://www.caracoltv.com/image-sitemap.xml |
sitemap | https://www.caracoltv.com/media-sitemap.xml |
sitemap | https://www.caracoltv.com/author-sitemap-content.xml |
sitemap | https://www.caracoltv.com/index-sitemap.xml |
Comments