openstaxcollege.org
robots.txt
Robots Exclusion Standard data for openstaxcollege.org
Resource Scan
Scan Details
Site Domain | openstaxcollege.org |
Base Domain | openstaxcollege.org |
Scan Status | Ok |
Last Scan | 2024-10-18T22:07:26+00:00 |
Next Scan | 2024-11-17T22:07:26+00:00 |
Last Scan
Scanned | 2024-10-18T22:07:26+00:00 |
URL | https://openstaxcollege.org/robots.txt |
Redirect | https://openstax.org/robots.txt |
Redirect Domain | openstax.org |
Redirect Base | openstax.org |
Domain IPs | 13.33.88.18, 13.33.88.85, 13.33.88.88, 13.33.88.99 |
Redirect IPs | 108.156.133.119, 108.156.133.69, 108.156.133.82, 108.156.133.91 |
Response IP | 108.156.133.69 |
Found | Yes |
Hash | 254f55cf69abe88ba64e4e7c9308c44fd7b6627d70f77165594cdbee2d3c4068 |
SimHash | f5045540a517 |
Groups
*
Rule | Path |
---|---|
Disallow | /accounts |
Disallow | /admin |
Disallow | /l/ |
Disallow | /r/ |
Disallow | /confirmation/ |
Disallow | /adoption-confirmation |
Disallow | /general |
Disallow | /contents |
Disallow | /extras |
Disallow | /errata |
Disallow | /resources |
Disallow | /apps/archive |
Disallow | /apps/archive-preview |
Disallow | /apps/cms/api/spike |
Other Records
Field | Value |
---|---|
sitemap | https://openstax.org/sitemap.xml |
sitemap | https://openstax.org/rex/sitemaps/index.xml |