uaptc.edu
robots.txt
Robots Exclusion Standard data for uaptc.edu
Resource Scan
Scan Details
Site Domain | uaptc.edu |
Base Domain | uaptc.edu |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-03-13T13:30:08+00:00 |
Next Scan | 2024-06-11T13:30:08+00:00 |
Last Successful Scan
Scanned | 2023-10-23T10:09:45+00:00 |
URL | https://uaptc.edu/robots.txt |
Domain IPs | 104.26.8.242, 104.26.9.242, 172.67.69.46, 2606:4700:20::681a:8f2, 2606:4700:20::681a:9f2, 2606:4700:20::ac43:452e |
Response IP | 172.67.69.46 |
Found | Yes |
Hash | 7bc45a1b81f1f1418741732d84b32a5c1374c73d390c3e3c61745a33081f3e3a |
SimHash | 11516849cf31 |
Groups
*
Rule | Path | Comment |
---|---|---|
Disallow | /docs/default-source/* | Block the doc directory. |
Disallow | Block pdf files. Non-standard but works for major search engines | |
Disallow | /images/default-source/* | Block images directory |