claremont.edu
robots.txt
Robots Exclusion Standard data for claremont.edu
Resource Scan
Scan Details
Site Domain | claremont.edu |
Base Domain | claremont.edu |
Scan Status | Ok |
Last Scan | 5/25/2025, 7:28:42 AM |
Next Scan | 6/24/2025, 7:28:42 AM |
Last Scan
Scanned | 5/25/2025, 7:28:42 AM |
URL | https://claremont.edu/robots.txt |
Redirect | https://www.claremont.edu/robots.txt |
Redirect Domain | www.claremont.edu |
Redirect Base | claremont.edu |
Domain IPs | 104.198.102.253 |
Redirect IPs | 104.198.102.253 |
Response IP | 104.198.102.253 |
Found | Yes |
Hash | 05edb2450b171e98da0f077e633773101effcd252897153657afe69f2e6ec4b0 |
SimHash | 6100dc104bb2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.claremont.edu/wp-sitemap.xml |