canopy.is
robots.txt
Robots Exclusion Standard data for canopy.is
Resource Scan
Scan Details
Site Domain | canopy.is |
Base Domain | canopy.is |
Scan Status | Ok |
Last Scan | 2025-05-24T18:33:56+00:00 |
Next Scan | 2025-06-07T18:33:56+00:00 |
Last Scan
Scanned | 2025-05-24T18:33:56+00:00 |
URL | https://canopy.is/robots.txt |
Domain IPs | 104.26.10.181, 104.26.11.181, 172.67.70.45, 2606:4700:20::681a:ab5, 2606:4700:20::681a:bb5, 2606:4700:20::ac43:462d |
Response IP | 104.26.11.181 |
Found | Yes |
Hash | 49d3925c7fc93482a8845fb45efccd9d5d37da67627aa5478906ebc22486c91a |
SimHash | 614593534311 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /respondent/ |
Disallow | /session/new/ |
Disallow | /blog/wp-admin/ |
Disallow | /m/finish_registration/ |
Disallow | /m/training/c2fb788c7deedbeaa296e |
Other Records
Field | Value |
---|---|
sitemap | https://canopy.is/sitemap.xml |
sitemap | https://canopy.is/blog/sitemap_index.xml |