clemson.edu
robots.txt
Robots Exclusion Standard data for clemson.edu
Resource Scan
Scan Details
Site Domain | clemson.edu |
Base Domain | clemson.edu |
Scan Status | Ok |
Last Scan | 2024-10-20T01:37:08+00:00 |
Next Scan | 2024-11-19T01:37:08+00:00 |
Last Scan
Scanned | 2024-10-20T01:37:08+00:00 |
URL | https://clemson.edu/robots.txt |
Redirect | https://www.clemson.edu/robots.txt |
Redirect Domain | www.clemson.edu |
Redirect Base | clemson.edu |
Domain IPs | 130.127.204.30, 2620:103:a004:36::30 |
Redirect IPs | 130.127.204.30, 2620:103:a004:36::30 |
Response IP | 130.127.204.30 |
Found | Yes |
Hash | 816d5558ec3a5a0df19bb4f400fbe47f08ce37fcb2b1ae2baaae8511d6a97492 |
SimHash | 6d05c9c5e1f1 |
Groups
*
Rule | Path |
---|---|
Disallow | *.html/ |