duq.edu
robots.txt
Robots Exclusion Standard data for duq.edu
Resource Scan
Scan Details
Site Domain | duq.edu |
Base Domain | duq.edu |
Scan Status | Ok |
Last Scan | 2024-10-19T20:02:10+00:00 |
Next Scan | 2024-11-18T20:02:10+00:00 |
Last Scan
Scanned | 2024-10-19T20:02:10+00:00 |
URL | https://duq.edu/robots.txt |
Domain IPs | 192.88.240.53 |
Response IP | 192.88.240.53 |
Found | Yes |
Hash | 3fcab75ea167b618d19c38874de97be3db23fd24eb4cd3cb5309c867bc748da0 |
SimHash | f41154445792 |
Groups
*
Rule | Path |
---|---|
Disallow | /_dev/ |
Disallow | /_training/ |
Disallow | /_migration/ |
Disallow | /*.xml$ |
Disallow | /academics/colleges-and-schools/music/downloads/index.php |
Other Records
Field | Value |
---|---|
sitemap | https://www.duq.edu/sitemap.xml |