verio.pancan.org
robots.txt
Robots Exclusion Standard data for verio.pancan.org
Resource Scan
Scan Details
Site Domain | verio.pancan.org |
Base Domain | pancan.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't establish SSL connection. |
Last Scan | 2025-03-11T04:45:53+00:00 |
Next Scan | 2025-06-09T04:45:53+00:00 |
Last Successful Scan
Scanned | 2023-10-25T18:42:37+00:00 |
URL | https://verio.pancan.org/robots.txt |
Redirect | https://pancan.org/robots.txt |
Redirect Domain | pancan.org |
Redirect Base | pancan.org |
Domain IPs | 40.122.65.162 |
Redirect IPs | 20.40.202.9 |
Response IP | 20.40.202.9 |
Found | Yes |
Hash | 2ec2ebd15d6d58dcb037ff4b89884462f11b2fff88695d57d1c0ce0106e76f34 |
SimHash | c095bd409033 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /email/ |
Disallow | /purpleride/ |
Disallow | /section_about/ |
Disallow | /section_facing_pancreatic_cancer/ |
Disallow | /section_get_involved/ |
Disallow | /section_stories/ |
Disallow | /timeforhope/ |
Disallow | /outreach/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.pancan.org/sitemap_index.xml |