studentclearinghouse.info
robots.txt
Robots Exclusion Standard data for studentclearinghouse.info
Resource Scan
Scan Details
Site Domain | studentclearinghouse.info |
Base Domain | studentclearinghouse.info |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-10-27T02:37:46+00:00 |
Next Scan | 2024-11-26T02:37:46+00:00 |
Last Successful Scan
Scanned | 2024-09-05T02:35:56+00:00 |
URL | https://studentclearinghouse.info/robots.txt |
Domain IPs | 108.157.254.30, 108.157.254.4, 108.157.254.42, 108.157.254.6 |
Response IP | 108.157.254.30 |
Found | Yes |
Hash | 62f14ae66a514cfa4ba77caf33d4cbbe674ca39e7146ed9f60f37ed69a1b3cd4 |
SimHash | 28146f5645d0 |
Groups
*
Rule | Path |
---|---|
Allow | /signature |
Allow | /snapshot |
Allow | /apps.php |
Allow | /audit/ |
Disallow | /docs/ |
Disallow | /admin-xml/ |
Disallow | /events/ |
Disallow | /docs/TO_tip_sheet.pdf |
Disallow | /docs/SSS_tip_sheet.pdf |
Disallow | /sitesearch/ |