jefferson.edu
robots.txt
Robots Exclusion Standard data for jefferson.edu
Resource Scan
Scan Details
Site Domain | jefferson.edu |
Base Domain | jefferson.edu |
Scan Status | Ok |
Last Scan | 2024-10-21T14:07:46+00:00 |
Next Scan | 2024-11-20T14:07:46+00:00 |
Last Scan
Scanned | 2024-10-21T14:07:46+00:00 |
URL | https://jefferson.edu/robots.txt |
Redirect | http://www.jefferson.edu/content/dam/academic/robots.txt |
Redirect Domain | www.jefferson.edu |
Redirect Base | jefferson.edu |
Domain IPs | 18.211.231.44, 34.234.162.145 |
Redirect IPs | 18.211.231.44, 34.234.162.145 |
Response IP | 18.211.231.44 |
Found | Yes |
Hash | 6f801d5aba702b4da7a9976c2374ddc0953ac077ebe928367fddab4b68e3fa58 |
SimHash | 9cf07f454918 |
Groups
*
Rule | Path |
---|---|
Disallow | /content/academic/new-university-DO-NOT-PUBLISH/ |
Disallow | /new-university-DO-NOT-PUBLISH/ |
Disallow | /dam/academic/test-folder/ |
Disallow | /dam/academic/test-folder/* |
Disallow | /dam/academic/path/to/specific/image.jpg |
Disallow | /dam/academic/email/ |
Disallow | /dam/academic/email/* |
Disallow | /dam/ist/orgcharts/ |
Disallow | /dam/ist/orgcharts/* |
Disallow | /dam/university/kanbar/ron-signature.gif |