clemson.edu
robots.txt

Robots Exclusion Standard data for clemson.edu

Resource Scan

Scan Details

Site Domain clemson.edu
Base Domain clemson.edu
Scan Status Ok
Last Scan2024-10-20T01:37:08+00:00
Next Scan 2024-11-19T01:37:08+00:00

Last Scan

Scanned2024-10-20T01:37:08+00:00
URL https://clemson.edu/robots.txt
Redirect https://www.clemson.edu/robots.txt
Redirect Domain www.clemson.edu
Redirect Base clemson.edu
Domain IPs 130.127.204.30, 2620:103:a004:36::30
Redirect IPs 130.127.204.30, 2620:103:a004:36::30
Response IP 130.127.204.30
Found Yes
Hash 816d5558ec3a5a0df19bb4f400fbe47f08ce37fcb2b1ae2baaae8511d6a97492
SimHash 6d05c9c5e1f1

Groups

*

Rule Path
Disallow *.html/

googlebot

Rule Path
Disallow /_ows_includes/php/
Disallow /_ows_includes/temp/
Disallow /*sn-config.html