studentclearinghouse.info
robots.txt

Robots Exclusion Standard data for studentclearinghouse.info

Resource Scan

Scan Details

Site Domain studentclearinghouse.info
Base Domain studentclearinghouse.info
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-27T02:37:46+00:00
Next Scan 2024-11-26T02:37:46+00:00

Last Successful Scan

Scanned2024-09-05T02:35:56+00:00
URL https://studentclearinghouse.info/robots.txt
Domain IPs 108.157.254.30, 108.157.254.4, 108.157.254.42, 108.157.254.6
Response IP 108.157.254.30
Found Yes
Hash 62f14ae66a514cfa4ba77caf33d4cbbe674ca39e7146ed9f60f37ed69a1b3cd4
SimHash 28146f5645d0

Groups

*

Rule Path
Allow /signature
Allow /snapshot
Allow /apps.php
Allow /audit/
Disallow /docs/
Disallow /admin-xml/
Disallow /events/
Disallow /docs/TO_tip_sheet.pdf
Disallow /docs/SSS_tip_sheet.pdf
Disallow /sitesearch/