depaul.edu
robots.txt
Robots Exclusion Standard data for depaul.edu
Resource Scan
Scan Details
Site Domain | depaul.edu |
Base Domain | depaul.edu |
Scan Status | Ok |
Last Scan | 2024-11-14T05:09:29+00:00 |
Next Scan | 2024-12-14T05:09:29+00:00 |
Last Scan
Scanned | 2024-11-14T05:09:29+00:00 |
URL | https://depaul.edu/robots.txt |
Redirect | https://www.depaul.edu/robots.txt |
Redirect Domain | www.depaul.edu |
Redirect Base | depaul.edu |
Domain IPs | 140.192.178.90, 216.220.184.90, 2620:0:2250:3b10::90, 2620:0:2250:4b10::90 |
Redirect IPs | 216.220.184.50, 2620:0:2250:3b10::50 |
Response IP | 216.220.184.50 |
Found | Yes |
Hash | 92a588f6881d6d509c2897cf419a378bbcb13ca3a1e26fa98bbcd954726636e5 |
SimHash | 79256d04cdd2 |
Groups
*
Rule | Path |
---|---|
Disallow | /Search/ |
Disallow | /ReusableContent/ |
Disallow | /Reports%20List/ |
Disallow | /WorkflowTasks/ |
Disallow | /SiteCollectionDocuments/ |
Disallow | /SiteCollectionImages/ |
Disallow | /Documents/Forms/ |
Disallow | /Pages/Forms/ |
Disallow | /university-catalog-archive/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.depaul.edu/sitemap.xml |