intercomcdn.com
robots.txt
Robots Exclusion Standard data for intercomcdn.com
Resource Scan
Scan Details
Site Domain | intercomcdn.com |
Base Domain | intercomcdn.com |
Scan Status | Ok |
Last Scan | 2024-05-05T18:11:11+00:00 |
Next Scan | 2024-05-19T18:11:11+00:00 |
Last Scan
Scanned | 2024-05-05T18:11:11+00:00 |
URL | https://intercomcdn.com/robots.txt |
Redirect | https://www.intercom.com/robots.txt |
Redirect Domain | www.intercom.com |
Redirect Base | intercom.com |
Domain IPs | 13.35.18.17, 13.35.18.48, 13.35.18.76, 13.35.18.85 |
Redirect IPs | 108.157.254.118, 108.157.254.18, 108.157.254.49, 108.157.254.99 |
Response IP | 108.157.254.99 |
Found | Yes |
Hash | a1e9870110d675f4ffc60a2e93c9a1ebc06084982368ea0b22d60fdf775532ba |
SimHash | 2f24b861ad93 |
Groups
*
Rule | Path |
---|---|
Disallow | /assets/books |
Disallow | /early-stage/* |
Disallow | /blogtest |
Disallow | /beta/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.intercom.com/sitemap_index.xml |